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SURVEILLANCE SYSTEM AND METHOD 
HAVING AN ADAPTIVE SEQUENTIAL 
PROBABILITY FAULT DETECTION TEST 

STATEMENT REGARDING FEDERALLY 
SPONSORED RESEARCH AND 
DEVELOPMENT 

The invention described herein was made in the perfor- 
mance of work under NASA Small Business Technology 
Transfer Research (STTR) Contract NAS8 -98027, and is 
subject to the provisions of Public Law 96-517 (35 USC 
202) and the Code of Federal Regulations 48 CFR 52.227-11 
as modified by 48 CFR 1852.227-11, in which the contractor 
has elected to retain title. 

The United States Government has rights in this invention 
pursuant to Contract No. W-3 1-109 -ENG-38 between the 
United States Government and the University of Chicago 
representing Argonne National Laboratory. 

FIELD OF THE INVENTION 

The instant invention relates generally to a system and 
method for process fault detection using a statistically based 
decision test and, in particular, to a system and method for 
performing high sensitivity surveillance of an asset such as 
a process and/or apparatus wherein the surveillance is per- 
formed using an adaptive sequential probability (ASP) fault 
detection test comprised of a probability density function 
model empirically derived from a numerical analysis of 
asset operating data. 

BACKGROUND OF THE INVENTION 

Conventional process surveillance schemes are sensitive 
only to gross changes in the mean value of a process signal 
or to large steps or spikes that exceed some threshold limit 
value. These conventional methods suffer from either a large 
number of false alarms (if thresholds are set too close to 
normal operating levels) or from a large number of missed 
(or delayed) alarms (if the thresholds are set too 
expansively). Moreover, most conventional methods cannot 
perceive the onset of a process disturbance or sensor signal 
error that gives rise to a signal below the threshold level or 
an alarm condition. Most conventional methods also do not 
account for the relationship between measurements made by 
one sensor relative to another redundant sensor or between 
measurements made by one sensor relative to predicted 
values for the sensor. 

Recently, improved methods for process surveillance 
have developed from the application of certain aspects of 
artificial intelligence technology. Specifically, parameter 
estimation methods have been developed using either 
statistical, mathematical or neural network techniques to 
learn a model of the normal patterns present in a system of 
process signals. After learning these patterns, the learned 
model is used as a parameter estimator to create one or more 
predicted (virtual) signals given a new observation of the 
actual process signals. Further, high sensitivity surveillance 
methods have been developed for detecting process and 
signal faults by analysis of a mathematical comparison 
between an actual process signal and its virtual signal 
counterpart. In particular, such a mathematical comparison 
is most often performed on a residual error signal computed 
as, for example, the difference between an actual process 
signal and its virtual signal counterpart. 

Parameter estimation based surveillance schemes have 
been shown to provide improved surveillance relative to 
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conventional schemes for a wide variety of assets including 
industrial, utility, business, medical, transportation, 
financial, and biological systems. However, parameter esti- 
mation based surveillance schemes have in general shown 
5 limited success when applied to complex processes. Appli- 
cants recognize and believe that this is because the param- 
eter estimation model for a complex process will, in general, 
produce residual error signals having a non-Gaussian prob- 
ability density function. Moreover, a review of the known 
prior-art discloses that virtually all such surveillance sys- 
tems developed to date utilize or assume a Gaussian model 
of the residual error signal probability density function for 
fault detection. Hence, a significant shortcoming of the 
known prior- art is that, inter alia, parameter estimation 
based surveillance schemes will produce numerous false 
15 alarms due to the modeling error introduced by the assump- 
tion of a Gaussian residual error signal probability density 
function. The implication for parameter estimation based 
surveillance schemes is that the fault detection sensitivity 
must be significantly reduced to prevent false alarms thereby 
20 limiting the utility of the method for process surveillance. 
An alternative for statistically derived fault detection models 
is to mathematically pre-process the residual error signals to 
remove non-Gaussian elements prior to using the residual 
error signals in the fault detection model; however this 
25 approach requires an excess of additional processing and 
also limits the sensitivity of the surveillance method. 
Therefore, the implication of assuming a Gaussian residual 
error signal probability density function for a parameter 
estimation based surveillance scheme is simply that the 
system becomes less accurate thereby degrading the sensi- 
tivity and utility of the surveillance method. 

Many attempts to apply statistical fault detection tech- 
niques to surveillance of assets such as industrial, utility, 
business, medical, transportation, financial, and biological 
processes have met with poor results in part because the fault 
35 detection models used or assumed a Gaussian residual error 
signal probability density function. 

In one specific example, a multivariate state estimation 
technique based surveillance system for the Space Shuttle 
Main Engine’s telemetry data was found to produce numer- 
40 ous false alarms when a Gaussian residual error fault detec- 
tion model was used for surveillance. In this case, the 
surveillance system’s fault detection threshold parameters 
were desensitized to reduce the false alarm rate; however, 
the missed alarm rate then became too high for practical use 
45 in the telemetry data monitoring application. 

Moreover, current fault detection techniques for surveil- 
lance of assets such as industrial, utility, business, medical, 
transportation, financial, and biological processes either fail 
to recognize the surveillance performance limitations that 
50 occur when a Gaussian residual error model is used or, 
recognizing such limitations, attempt to artificially conform 
the observed residual error data to fit the Gaussian model. 
This may be attributed, in part, to the relative immaturity of 
the field of artificial intelligence and computer- assisted 
55 surveillance with regard to real-world process control appli- 
cations. Additionally, a general failure to recognize the 
specific limitations that a Gaussian residual error model 
imposes on fault decision accuracy for computer- assisted 
surveillance is punctuated by an apparent lack of known 
60 prior art teachings that address potential methods to over- 
come this limitation. In general, the known prior-art teaches 
computer-assisted surveillance solutions that either ignore 
the limitations of the Gaussian model for reasons of math- 
ematical convenience or attempt to conform the actual 
65 residual error data to artificial Gaussian model, for example, 
by using frequency domain filtering and signal whitening 
techniques. 
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For the foregoing reasons, there is a need for a surveil- 
lance system and method that overcomes the significant 
shortcomings of the known prior-art as delineated herein- 
above. 

SUMMARY OF THE INVENTION 

The instant invention is distinguished over the known 
prior art in a multiplicity of ways. For one thing, the instant 
invention provides a surveillance system and method having 
a fault detection model of unconstrained probability density 
function form and having a procedure suitable for overcom- 
ing a performance limiting trade-off between probability 
density function modeling complexity and decision accu- 
racy that has been unrecognized by the known prior-art. 
Specifically, the instant invention can employ any one of a 
plurality of residual error probability density function model 
forms, including but not limited to a Gaussian form, thereby 
allowing a surveillance system to utilize the model form best 
suited for optimizing surveillance system performance. 

Moreover, the instant invention provides a surveillance 
system and method that uses a computer-assisted learning 
procedure to automatically derive the most suitable form of 
the residual error probability density function model by 
observation and analysis of a time sequence of process 
signal data and by a combination of a plurality of techniques. 
This ability enables surveillance to be performed by the 
instant invention with lower false alarm rates and lower 
missed alarm rates than can be achieved by the known 
prior- art systems and methods. 

Thus, the instant invention provides a surveillance system 
and method that performs its intended function much more 
effectively by enabling higher decision accuracy. Further, 
the instant invention is suitable for use with a plurality of 
parameter estimation methods thereby providing a capabil- 
ity to improve the decision performance of a wide variety of 
surveillance systems. 

In one preferred form, the instant invention provides a 
surveillance system and method that creates and uses, for the 
purpose of process surveillance, a multivariate state estima- 
tion technique parameter estimation method in combination 
with a statistical hypothesis test fault detection method 
having a probability density function model empirically 
derived from a numerical analysis of asset operating data. 

Particularly, the instant invention provides a surveillance 
system and method for providing surveillance of an asset 
such as a process and/or apparatus by providing training and 
surveillance procedures. 

In accordance with the instant invention, the training 
procedure is comprised of a calibrate parameter estimator 
procedure and a calibrate fault detector procedure. 

The calibrate parameter estimator procedure creates a 
parameter estimation model(s) and trains a parameter esti- 
mation model by utilizing training data correlative to 
expected asset operation and, for example, utilizing a mul- 
tivariate state estimation technique (MSET) procedure. The 
calibrate parameter estimator procedure further stores this 
model as a new parameter estimation model in a process 
model. The process model may contain one or more param- 
eter estimation models depending upon design requirements. 
Additionally, each parameter estimation model contained in 
the process model may be created to implement any one of 
a plurality of parameter estimation techniques. 

The calibrate fault detector procedure makes use of the 
parameter estimation models to provide estimated values for 
at least one signal parameter contained in the training data. 
Generally, the calibrate fault detector procedure will create 
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a separate and distinct fault detection model for each sensor 
or data signal associated with the asset being monitored for 
the presence of fault conditions during the surveillance 
procedure. 

5 The calibrate fault detector procedure includes, for 
example, a method of fitting a standard Gaussian probability 
density function (PDF) to a training data residual distribu- 
tion (computed as a function of estimated process param- 
eters and the training data) and then adding successive 
10 higher order terms of a remainder function to the standard 
Gaussian PDF for the purpose of defining a general PDF that 
better fits the computed training data residual distribution. 
Other techniques for fitting a general PDF to the training 
data are similarly feasible and useful in accordance with the 
15 instant invention and include, for example, a technique for 
fitting a polynomial function to the training data. 

The training procedure is completed when all training 
data has been used to calibrate the process model. At this 
point, the process model preferably includes parameter 
20 estimation models and fault detection models for each 
sensor or data signal associated with the asset being moni- 
tored for the presence of fault conditions during the surveil- 
lance procedure. The process model is thereafter used for 
performing surveillance of the asset. 

The surveillance procedure is performed using an adap- 
tive sequential probability (ASP) fault detection test com- 
prised of the general probability density function model 
empirically derived from a numerical analysis of the asset 
3Q training data. 

The surveillance procedure acquires and digitizes current 
asset data and then estimates process parameters as a func- 
tion of the acquired digitized current asset data and the 
parameter estimation model(s) obtained from the calibrate 
35 parameter estimator procedure. Then, fault detection is 
determined by first computing data residuals as a function of 
the estimated process parameters and the acquired digitized 
current asset data and then performing the ASP test(s) as a 
function of the fault detection models and thus, as a function 
40 of the fitted general PDF obtained in the calibrate fault 
detector procedure. Each ASP test returns one of three 
possible states: a not null state which rejects the probability 
that a null hypothesis is true and excepts an alternative 
hypothesis correlative to unexpected operation of the asset; 
45 a null state which accepts the probability that a null hypoth- 
esis is true and excepts the null hypothesis correlative to 
expected operation of the asset; and an in-between state 
which excepts neither the null hypothesis nor the alternative 
hypothesis as being true and requires more data to reach a 
50 conclusion. 

The results of the fault detection procedure are then 
analyzed according to the instant invention such that an 
alarm and/or a control action are taken when the analysis 
determines that the results indicate unexpected operation of 
55 the asset. 

Hence, the instant invention is distinguished over the 
known prior-art by providing a surveillance system and 
method for performing surveillance of an asset by acquiring 
residual data correlative to expected asset operation; fitting 
60 an equation to said acquired residual data; storing said fitted 
equation in a memory means; collecting a current set of 
observed signal data values from the asset; using said fitted 
equation in a sequential hypotheses test to determine if said 
current set of observed signal data values indicate unex- 
65 pected operation of the asset indicative a fault condition; and 
outputting a signal correlative to each detected fault condi- 
tion for providing asset surveillance. 
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OBJECTS OF THE INVENTION 

Accordingly, the instant invention provides a new, novel 
and useful surveillance system and method having an adap- 
tive sequential probability fault detection test. 

In one embodiment of the instant invention the surveil- 
lance system and method includes an unconstrained form of 
a residual error probability density function model used in 
said surveillance system’s fault detection method. 

In one embodiment of the instant invention the surveil- 
lance system and method can perform high sensitivity 
surveillance of a wide variety of assets including industrial, 
utility, business, medical, transportation, financial, and bio- 
logical processes and apparatuses wherein such process 
and/or apparatus asset preferably has at least one pair of 
redundant actual and/or virtual signals. 

In one embodiment of the instant invention the surveil- 
lance system and method includes a statistical hypothesis 
test surveillance decision procedure that uses a fault detec- 
tion model comprised of a probability density function 
model of a residual error signal that is of an unconstrained 
form. 

In one embodiment of the instant invention the surveil- 
lance system and method creates an improved fault detection 
model for a process surveillance scheme using recorded 
operating data for an asset to train a fault detection model. 

In one embodiment of the instant invention the surveil- 
lance system and method provides an improved system and 
method for surveillance of on-line, real-time signals, or 
off-line accumulated signal data. 

In one embodiment of the instant invention the surveil- 
lance system and method provides an improved system and 
method for surveillance of signal sources and detecting a 
fault or error state of the signal sources enabling responsive 
action thereto. 

In one embodiment of the instant invention the surveil- 
lance system and method provides an improved system and 
method for surveillance of signal sources and detecting a 
fault or error state of the asset processes and apparatuses 
enabling responsive action thereto. 

In one embodiment of the instant invention the surveil- 
lance system and method provides an improved decision as 
to the accuracy or validity for at least one process signal 
parameter given an observation of at least one actual signal 
from the asset. 

In one embodiment of the instant invention the surveil- 
lance system and method provides an improved system and 
method for ultra-sensitive detection of a fault or error state 
of signal sources and/or asset processes and apparatuses 
wherein the parameter estimation technique used for the 
generation of at least one virtual signal parameter is a 
multivariate state estimation technique (MSET) having any 
one of a plurality of pattern recognition matrix operators, 
training procedures, and operating procedures. 

In one embodiment of the instant invention the surveil- 
lance system and method provides an improved system and 
method for ultra-sensitive detection of a fault or error state 
of signal sources and/or asset processes and apparatuses 
wherein the parameter estimation technique used for the 
generation of at least one virtual signal parameter is a neural 
network having any one of a plurality of structures, training 
procedures, and operating procedures. 

In one embodiment of the instant invention the surveil- 
lance system and method provides an improved system and 
method for ultra-sensitive detection of a fault or error state 
of signal sources and/or asset processes and apparatuses 
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wherein the parameter estimation technique used for the 
generation of at least one virtual signal parameter is a 
mathematical process model having any one of a plurality of 
structures, training procedures, and operating procedures. 

5 In one embodiment of the instant invention the surveil- 
lance system and method provides an improved system and 
method for ultra-sensitive detection of a fault or error state 
of signal sources and/or asset processes and apparatuses 
wherein the parameter estimation technique used for the 
1° generation of at least one virtual signal parameter is an 
autoregressive moving average ( ARMA) model having any 
one of a plurality of structures, training procedures, and 
operating procedures. 

In one embodiment of the instant invention the surveil- 
15 lance system and method provides an improved system and 
method for ultra-sensitive detection of a fault or error state 
of signal sources and/or asset processes and apparatuses 
wherein the parameter estimation technique used for the 
generation of at least one virtual signal parameter is a 
20 Kalman filter model having any one of a plurality of 
structures, training procedures, and operating procedures. 

In one embodiment of the instant invention the surveil- 
lance system and method provides a novel system and 
25 method for using at least one of a plurality of methods to 
classify the state of a residual error signal produced by the 
mathematical difference between two signals, said two sig- 
nals being either actual and/or predicted signals, into one of 
at least two categories. 

30 In one embodiment of the instant invention the surveil- 
lance system and method provides a novel system and 
method to classify the state of a residual error signal wherein 
said classification is made to distinguish between a normal 
signal and a abnormal signal. 

35 In one embodiment of the instant invention the surveil- 
lance system and method provides a novel system and 
method to classify the state of a residual error signal wherein 
said classification is performed using a statistical hypothesis 
test having any one of a plurality of probability density 
40 function models, training procedures, and operating proce- 
dures. 

In one embodiment of the instant invention the surveil- 
lance system and method provides a novel system and 
method to classify the state of a residual error signal wherein 
45 said classification is performed using a probability density 
function model having any one of a plurality of structures, 
training procedures, and operating procedures. 

BRIEF DESCRIPTION OF THE DRAWINGS 

50 FIG. 1 is a block diagram of a surveillance system of a 
preferred embodiment in accordance with the instant inven- 
tion. 

FIG. 2 is a schematic functional flow diagram of a 
preferred embodiment in accordance with the instant inven- 
tion. 

FIG. 3 is a schematic functional flow diagram of a 
preferred method and system for training a process model 
consisting of at least one parameter estimation model and at 
60 least one fault detection model using recorded observations 
of the actual process signals in accordance with the instant 
invention. 

FIG. 4 is a schematic functional flow diagram of a method 
and system for the fault detection model training procedure 
65 in accordance with the instant invention. 

FIG. 5 is a schematic functional flow diagram of a 
preferred method and system for performing surveillance of 
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an asset using at least one parameter estimation model and 
at least one fault detection model in accordance with the 
instant invention. 

FIG. 6 is a schematic functional flow diagram of a method 
and system for the fault detection surveillance procedure in 
accordance with the instant invention. 

FIG. 7 is a schematic functional flow diagram of a method 
and system for performing parameter estimation and fault 
detection using a redundant sensor. 

FIG. 8 is a schematic functional flow diagram of a method 
and system for performing parameter estimation and fault 
detection using a generalized parameter estimation model, 
such as a multivariate state estimation model, a neural 
network model, an analytical model, or a Kalman filter 
model. 

FIG. 9 is a flow diagram of a training and surveillance 
procedure in accordance with the instant invention. 

FIG. 10 illustrates characteristics of the null and alternate 
hypotheses for the prior-art sequential probability ratio test 
(SPRT) mean tests. 

FIG. 11 illustrates characteristics of the null and alternate 
hypotheses for the prior- art SPRT variance tests. 

FIG. 12 illustrates acquired operating data, estimated 
parameter data, and residual error data for a typical Space 
Shuttle Main Engine accelerometer. 

FIG. 13 illustrates a probability density function of the 
residual error data for a typical Space Shuttle Main Engine 
accelerometer with comparison to a Gaussian probability 
density function. 

FIG. 14 illustrates an un-optimized one- term expansion 
probability density function model of the residual error data 
for a typical Space Shuttle Main Engine accelerometer with 
comparison to the actual residual error data and a Gaussian 
probability density function. 

FIG. 15 illustrates an un-optimized two-term expansion 
probability density function model of the residual error data 
for a typical Space Shuttle Main Engine accelerometer with 
comparison to the actual residual error data and a Gaussian 
probability density function. 

FIG. 16 illustrates an un-optimized three-term expansion 
probability density function model of the residual error data 
for a typical Space Shuttle Main Engine accelerometer with 
comparison to the actual residual error data and a Gaussian 
probability density function. 

FIG. 17 lists the root mean square errors for five different 
un-optimized probability density function models computed 
for each of six Space Shuttle Main Engine accelerometers 
used for feasibility testing of a preferred embodiment in 
accordance with the instant invention. 

FIG. 18 lists the root mean square errors for five different 
optimized probability density function models computed for 
each of six Space Shuttle Main Engine accelerometers used 
for feasibility testing of a preferred embodiment in accor- 
dance with the instant invention. 

FIG. 19 illustrates the optimized two-term expansion 
probability density function model of the residual error data 
for a typical Space Shuttle Main Engine accelerometer with 
comparison to the actual residual error data and a Gaussian 
probability density function. 

FIG. 20 illustrates the empirical false alarm rates for 
feasibility testing of a preferred embodiment in accordance 
with the instant invention with comparison to the prior art 
SPRT method. 

DESCRIPTION OF PREFERRED 
EMBODIMENTS 

Considering the drawings, wherein like reference numer- 
als denote like parts throughout the various drawing figures, 
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reference numeral 10 is directed to the system according to 
the instant invention. 

In its essence, and referring to FIGS. 1 and 2, the system 
10 is generally comprised of a method and apparatus for 
5 performing high sensitivity surveillance of a wide variety of 
assets including industrial, utility, business, medical, 
transportation, financial, and biological processes and appa- 
ratuses wherein such process and/or apparatus asset prefer- 
ably has at least one distinct measured or observed signal or 
io sequence comprised of characteristic data values which are 
processed by the system 10 described herein for providing 
ultra-sensitive detection of the onset of sensor or data signal 
degradation, component performance degradation, and pro- 
cess operating anomalies. The system 10 includes a training 
15 procedure 30 carried out on a computer 22 such that a 
process model 50 of an asset 12 (e.g., a process and/or 
apparatus) is stored in an associated memory means 24 after 
being learned from historical operating data using at least 
one of a plurality of computer- assisted procedures in accor- 
20 dance with the instant invention. The historical operating 
data includes a set or range of observations from expected or 
typical operation of the asset 12 that are acquired and 
digitized by a data acquisition means 20 and stored in a 
memory means 24 as training data 34 by using any combi- 
25 nation of electronic data acquisition hardware and signal 
processing software 20 known to those having ordinary skill 
in the art, and informed by the present disclosure. As will be 
delineated in detail infra, one hallmark of the instant inven- 
tion is the process model 50 for the asset 12 that is derived 
30 during the training procedure 30. 

The system 10 further includes a surveillance procedure 
60 wherein the stored process model 50 is used for high 
sensitivity computer- assisted surveillance of the asset 12 for 
the purpose of determining whether a process fault or failure 
35 necessitates an alarm or control action. The process model 
50 is comprised of a parameter estimation model 52 or 
collection of parameter estimation models 52 as necessary to 
provide an estimated value for each sensor or data signal 14 
of asset 12 to be monitored for the presence of fault 
40 conditions during the surveillance procedure 60. The pro- 
cess model 50 is further comprised of a fault detection model 
54 or collection of fault detection models 54 such that at 
least one fault detection model 54 is provided for each 
sensor or data signal 14 of asset 12 to be monitored for the 
45 presence of fault conditions during the surveillance proce- 
dure 60. The fault detection model 54 is another hallmark of 
the instant invention and will be delineated in further detail 
hereinbelow. 

Process Model Training Procedure 
50 More specifically, and referring to FIGS. 1 and 3, the 
training procedure 30 of the system 10 includes a method 
and apparatus for training or preparing the process model 50 
using historical operating data from the asset 12 that has 
been acquired by the data acquisition means 20 using any 
55 combination of conventional electronic data acquisition 
hardware and signal processing software as is well known in 
the art. The historical operating data is acquired in digital 
format and stored in memory means 24 using a data storage 
procedure 32 to create a training data set 34. 

60 The training data set 34 includes at least N discrete 
observations of the asset 12 wherein each single 
observation, herein denoted Xobs, is comprised of a vector 
of data values for each signal parameter to be included in the 
process model 50. For the purposes of the training procedure 
65 30, the number of observations, N, acquired is at least great 
enough to adequately bound the operating state space of the 
asset 12. Thus, the training data set 34 provides a represen- 
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tative sample of the signals produced by the asset 12 during 
all normal modes of operation. 

Upon acquiring the training data set 34, the unique 
training procedure 30 can be implemented in accordance 
with instant invention. 

The unique training procedure 30 is comprised of a 
calibrate parameter estimator procedure 36 and a calibrate 
fault detector procedure 38. The calibrate parameter estima- 
tor procedure 36 creates the parameter estimation model 52 
and trains the parameter estimation model 52 using the 
training data 34. The calibrate parameter estimator proce- 
dure 36 further stores this model as a new parameter 
estimation model 52 in the process model 50. 

The process model 50 may contain one or more parameter 
estimation models 52 depending upon the requirements of 
the approach taken by a designer. Continuing to refer to FIG. 
3, the training procedure 30 may be, in general, performed 
using any parameter estimation method suitable for defining 
a parameter estimation model 52 useful for estimating the 
values of one or more process signals. Methods suitable for 
the calibrate parameter estimator procedure 36 include, but 
are not limited to, a plurality of redundant sensor techniques, 
a plurality of multivariate state estimation techniques, a 
plurality of neural network techniques, a plurality of math- 
ematical model techniques, a plurality of autoregressive 
moving average techniques, and a plurality of Kalman filter 
techniques. Each parameter estimation model 52 contained 
in the process model 50 may be created to implement any 
one of a plurality of parameter estimation techniques. 
Further, the parameter estimation technique implemented 
for an individual parameter estimation model 52 is not 
constrained to be the same as the parameter estimation 
technique implemented for any other parameter estimation 
model 52 contained in the process model 50. 

One example of the calibrate parameter estimator proce- 
dure 36 would be the computation of a bias term between 
two redundant sensors wherein the parameter estimation 
model 52 used for estimating the value of one sensor during 
the surveillance procedure 60 consisted of summing the 
observed value of a second redundant sensor with a bias 
term computed during the training procedure 30 as the mean 
difference between the two sensor values over the training 
data set 34. More sophisticated examples of the training 
procedure 30 using multivariate state estimation techniques 
will be described herein below. 

Still referring to FIG. 3, the calibrate fault detector 
procedure 38 makes use of the parameter estimation models 
52 to provide estimated values for at least one signal 
parameter contained in the training data. Generally, the 
calibrate fault detector procedure 38 will create a separate 
and distinct fault detection model 54 for each sensor or data 
signal of asset 12 to be monitored for the presence of fault 
conditions during the surveillance procedure 60. As delin- 
eated infra, one hallmark of the instant invention is the fault 
detection model 54 element of the process model 50 for the 
asset 12 that is derived during the training procedure 30. In 
particular, the instant invention encompasses a statistical 
hypothesis test type of fault detection model 54 having novel 
and unique characteristics and calibration procedures 
described herein including but not limited to having a 
probability density function model empirically derived from 
a numerical analysis of asset operating data. 

Continuing to refer to FIG. 3, the training procedure 30 is 
completed when all training data 34 has been used to 
calibrate the process model 50. At this point, the process 
model 50 includes parameter estimation models 52 and fault 
detection models 54 for each sensor or data signal of asset 
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12 to be monitored for the presence of fault conditions 
during the surveillance procedure 60. The process model 
array 50 is thereafter useful for performing surveillance of 
the asset 12 . 

5 Referring to FIG. 4, the training procedure 30 is illus- 
trated in additional detail. A designer 16 initializes the 
calibrate parameter estimators procedure 36 by specifying a 
set of parameter estimator methods and settings 40. The 
parameter estimator methods and settings 40 are then used 
10 to operate on the training data 34 via a nominal signal 
behavior modeling procedure 42, for example using an 
MSET training procedure as described herein in detail, in 
order to create the parameter estimation models 52, which 
are stored in the process model 50. 

15 Still referring to FIG. 4, the training procedure 30 next 
proceeds to the calibrate fault detectors procedure 38 
wherein the parameter estimation models 52 are an input to 
the procedure. The designer 16 initializes the calibrate fault 
detectors procedure 38 by specifying a set of fault detector 
20 methods and settings 46. Next, the estimate process param- 
eters procedure 64 operates the parameter estimation models 
52 over the training data 34 to generate an estimated value 
for each monitored signal value contained in the training 
data 34. It is important that the estimate process parameters 
25 procedure 64 used in the calibrate fault detectors procedure 
38 be the same estimate process parameters procedure 64 
that will later be used in the surveillance procedure 60 
(reference FIGS. 1 and 2 for surveillance procedure 60). 
Next, A compute training data residuals procedure 44 cal- 
30 culates the training data residuals for each monitored signal, 
which are calculated as the difference between the training 
data value and the corresponding estimated data value for 
each monitored signal. The training data residuals are next 
used by a compute nominal residual probability density 
35 function (PDF) procedure 48 to create a fault detection 
model 54 for each monitored signal. In accordance with the 
instant invention, the fault detection models 54 is typically 
comprised of mathematical descriptions of the probability 
density function that best characterizes or best fits the 
40 training data residual for the monitored signal. The training 
data is presumed to accurately characterize the expected 
normal operating states of the asset 12. Therefore, the 
training data residuals are characteristic of the expected 
normal deviations between the observed signal values and 
45 the values estimated using the parameter estimation models 
52. The fault detection models 54 are stored in the process 
model 50 thereby completing the training procedure. 

One hallmark of the instant invention is the method and 
system for computing the fault detection models 54 by the 
50 means of the compute nominal residual probability density 
function (PDF) procedure 48. As will be described math- 
ematically herein below, the compute nominal residual prob- 
ability density function (PDF) procedure 48 fits a general 
open-ended probability function to the training data residu- 
55 als and employs this fitted function when implementing a 
herein named Adaptive Sequential Probability (ASP) 
method and system for computing the fault detection model 
54 and thereafter employing said fault detection model 54 
for the purpose of performing a fault detection procedure 66 
60 of the surveillance procedure 60. 

Surveillance Procedure 

Referring to FIG. 5, the surveillance procedure 60 is 
comprised of acquiring successive vectors of current oper- 
ating data and determining for each such observation vector 
65 whether the current operating data is indicative of a fault or 
failure of the asset 12. The surveillance procedure 60 further 
includes implementing an alarm or control action 70 for the 
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purpose of notifying an operator and/or taking a corrective 
action in response to a detected fault or failure of the asset 
12. The surveillance procedure 60 is in general an open- 
ended data acquisition and analysis loop that continues until 
such time as the operator chooses to terminate the surveil- 
lance 74. 

More specifically, and referring to FIG. 5, the surveillance 
procedure 60 begins 58 with an acquire current operating 
data procedure 62 that employs the data acquisition and 
digitization means 20 (FIG. 1) to acquire a current set of 
signal data from the monitored asset 12. The current set of 
signal data is provided to the estimate process parameters 
procedure 64 that uses the parameter estimation models 52 
to estimate values for one or more of the current signal data 
values. The observed and estimated data are next provided 
to a perform fault detection procedure 66 that uses the fault 
detection models 54 to determine whether a fault is found 68 
in the current operating data. If a fault is found 68 is true, an 
alarm and/or control action 70 is taken. Upon completing the 
fault found procedure 68, the surveillance procedure 60 then 
repeats for the next available set of signal data for as long as 
a surveillance complete decision procedure 72 determines 
that additional surveillance data are available or terminates 
at surveillance complete step 74 when no more surveillance 
data are available or when terminated by an operator. 

Referring now to FIG. 6, the perform fault detection 
procedure 66 of surveillance procedure 60 is illustrated in 
additional detail. For each current set of signal data values 
acquired the estimate process parameters procedure 64 uses 
the parameter estimation models 52 to estimate values for 
one or more of the current signal data values. The compute 
data residuals procedure 76 performs a mathematical trans- 
formation on the acquired and estimated values to produce 
a current set of residual data values. Said mathematical 
transformation is most typically a simple mathematical 
difference, however, any appropriate transformation may be 
used including transformations that remove correlated and 
uncorrelated noise from the residual data values. The residu- 
als produced and transformed in the compute data residuals 
procedure 76 are next processed by a perform ASP tests 
procedure 78 that uses the fault detection models 54 to 
produce a ASP fault indication that is a hallmark of the 
method and system of the instant invention. Next, the fault 
found decision procedure 68 is performed on the basis of the 
ASP fault indication results produced by the perform ASP 
tests procedure 78. The fault found decision procedure 68 
may have any one of a plurality of structures and procedures, 
including but not limited to methods and systems to perform 
false alarm filtering by means of observing a time series of 
ASP fault indication results for the purposes of determining 
the actual presence of a fault. In one preferred embodiment 
of the instant invention, a conditional probability fault found 
decision procedure 68 is used to perform said false alarm 
filtering. 

Continuing to refer to FIG. 6, the estimate process param- 
eters procedure 64 uses the parameter estimation models 52 
to estimate values for one or more of the current signal data 
values wherein the parameter estimation method used may 
have any one of a plurality of structures and procedures, 
including but not limited to, a plurality of redundant sensor 
techniques, a plurality of multivariate state estimation 
techniques, a plurality of neural network techniques, a 
plurality of mathematical model techniques, a plurality of 
autoregressive moving average techniques, and a plurality of 
Kalman filter techniques. 

Referring to FIG. 7, one possible redundant sensor tech- 
nique for the estimate process parameters procedure 64 is 
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illustrated. The acquire current operating data procedure 62 
is used to acquire current signal data values from signals 14 
monitored from asset 12 via sensors 18. The estimated value 
for a first redundant sensor signal is computed using a 
5 mathematical transformation on the acquired value of a 
second redundant sensor signal. Said mathematical transfor- 
mation is the estimate process parameters procedure 64 that 
in this case may be a simple equivalence or may include 
biasing, de-noising or other signal processing. The compute 
10 data residuals procedure 76 is then performed followed by 
the perform ASP tests procedure 78 as described herein- 
above and further delineated hereinbelow. 

Referring to FIG. 8, one possible multivariable parameter 
estimation technique for the estimate process parameters 
15 procedure 64 is illustrated. The acquire current operating 
data procedure 62 is used to acquire current signal data 
values from signals 14 monitored from asset 12 via sensors 
18. The estimated value for one or more sensor signals is 
computed using a mathematical transformation on the 
20 acquired values of one or more sensor signals. Said math- 
ematical transformation is the estimate process parameters 
procedure 64 that in this case may implement any feasible 
parameter estimation technique or procedure, including but 
not limited to a plurality of multivariate state estimation 
25 techniques, a plurality of neural network techniques, a 
plurality of mathematical model techniques, and a plurality 
of Kalman filter techniques. The compute data residuals 
procedure 76 is then performed followed by the perform 
ASP tests procedure 78 as described hereinabove and further 
30 delineated hereinbelow. 

Referring again to FIG. 6, the usefulness of the instant 
invention is, inter alia, the improvement achieved in the 
accuracy of the fault decision procedure 68 that results from 
the improvement achieved in the accuracy of perform fault 
35 detection procedure 66 made possible by the novel perform 
ASP tests procedure 78 that is a hallmark of the instant 
invention. Improving the accuracy of the fault decision 
procedure 68 accomplishes a reduction in the number of 
false alarms sent to a process operator or control system that 
40 can in turn result in an erroneous alarm or control action by 
the alarm or control action procedure 70. Further, improving 
the accuracy of the fault decision procedure 68 accomplishes 
a reduction in the number of missed alarms thereby accom- 
plishing more timely alarm or control action by the alarm or 
45 control action procedure 70. The instant invention thereby 
enables improved operating safety, improved efficiency and 
performance, and reduced maintenance costs for a wide 
variety of industrial, utility, business, medical, 
transportation, financial, and biological processes and appa- 
50 ratuses wherein such process and/or apparatus asset 12 has 
at least one characteristic data signal suitable for surveil- 
lance. 

In use and operation, and in one preferred form, FIGS. 1 
and 9 outline a general surveillance procedure of the system 
55 10 when employing the novel fault detection model 54 
contained in the process model 50 and the accompanying 
novel fault detection procedure 66 having the perform ASP 
tests procedure 78. In a typical surveillance procedure, the 
asset 12 is the source of at least one process signal 14 that 
60 is acquired and digitized using conventional data acquisition 
means 20 for providing the data acquisition procedure for 
the purpose of computer- assisted surveillance. The digitized 
signal data is generally evaluated using a computer 22 
having computer software modules implementing the esti- 
65 mate process parameters procedure 64, and the perform fault 
detection procedure 66. The estimate process parameters 
procedure 64 is used to produce an estimated signal value for 
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at least one process signal 14 emanating from the asset 12. 
The estimate process parameters procedure 64 in general 
makes use of the process model 50 stored in a memory 
means 24 associated with the computer 22 to produce the 
estimated signal values. The estimated signal values are then 
generally evaluated using the perform fault detection pro- 
cedure 66 to identify faults or operating anomalies in the 
asset 12. The results of the fault detection evaluation are 
thereafter communicated by a conventional communications 
means 26 (as is known to those having ordinary skill in the 
art, and informed by the present disclosure) to an operator 
console 80 and/or automated process control system 82 for 
possible alarm and/or control action. 

The computer 22 along with the associated memory 
means 24 can also be employed to perform the training and 
surveillance procedures 30, 60 as delineated hereinabove 
and to produce and store all the data associated with these 
procedures, for example, the historical training data, 
designer defined settings, and the process model. 
Multivariate State Estimation Technique (MSET) for Esti- 
mate Process Parameters Procedure 

In one preferred embodiment of the instant invention, the 
estimate process parameters procedure 64 uses a multivari- 
ate state estimation technique (MSET) procedure having an 
MSET parameter estimation model 52 structure. The US 
Department of Energy’s Argonne National Laboratory origi- 
nally developed the implementation of MSET described 
herein for surveillance of sensors and components in nuclear 
power plant applications. However, other implementations 
of a multivariate state estimation technique are possible and 
useful in conjunction with the instant invention. MSET is in 
general a statistically derived parameter estimation algo- 
rithm that uses advanced pattern recognition techniques to 
measure the similarity or overlap between signals within a 
defined domain of asset operation (set of asset operating 
states). MSET “learns” patterns among the signals by 
numerical analysis of historical asset operating data. These 
learned patterns or relationships among the signals are then 
used to identify the learned state that most closely corre- 
sponds with a new signal data observation. By quantifying 
the relationship between the current and learned states, 
MSET estimates the current expected response of the asset 
signals. MSET then uses threshold comparisons or a form of 
statistical hypothesis testing, such as a Sequential Probabil- 
ity Ratio Test (SPRT) as disclosed in U.S. Pat. No. 5,459,675 
which is hereby incorporated by reference in its entirety or 
the Adaptive Sequential Probability (ASP) procedure that is 
a hallmark of the instant invention to compare the current 
estimated value of a signal with its observed value. The 
MSET procedure provides an accurate and widely appli- 
cable method to estimate the operating signal values for an 
asset. However, other implementations of the parameter 
estimation procedure are possible and useful in conjunction 
with the instant invention. 

An MSET model is created for the asset 12 using the 
MSET training algorithms to learn the inherent data rela- 
tionships within a set of historical asset operating data. The 
trained MSET model is then used with the MSET parameter 
estimation and fault detection algorithms to perform the 
process surveillance function when presented with a new 
observation of signal data values. The following sections 
will first provide a mathematical overview of the MSET 
algorithms and procedures useful for training a parameter 
estimation model and for using this trained model for 
process surveillance. The description is followed by a 
detailed description of a preferred embodiment of the instant 
invention using a novel Adaptive Sequential Probability 
(ASP) procedure for fault detection during process surveil- 
lance. 


MSET Training and Parameter Estimation Procedures 

The MSET methods are generally described in the fol- 
lowing two US Government documents produced and main- 
tained by the US Department of Energy’s Argonne National 
5 Laboratory, Argonne, 111., disclosure of which is incorpo- 
rated in its entirety herein by reference. 

J. P. Herzog, S. W. Wegerich, R. M. Singer, and K. C. 
Gross, “Theoretical Basis of the Multivariate State 
Estimation Technique (MSET),” Argonne National 
10 Laboratory, ANL-NT-49, December 1997. 

J. P. Herzog, S. W. Wegerich, K. C. Gross, and R. M. 
Singer, “MSET: Code Structure and Interface Devel- 
opment Guide,” ANL-NT-48, August 1997. 

The MSET algorithm uses pattern recognition with his- 
torical operating data from an asset to generate a parameter 
estimation model. If data is collected from a process over a 
range of operating states, this data can be arranged in matrix 
form, where each column vector (a total of m) in the matrix 
represents the measurements made at a particular state. 
Thus, this matrix will have the number of columns equal to 
20 the number of states at which observations were made and 
the number of rows equal to the number of measurements (a 
total of n signal data values) that were available at each 
observation. We begin by defining the set of measurements 
taken at a given time t as an observation vector X(ty), 

25 

X(tyH Xi(ty), X 2 (t y ), . . . , X n (tj)Y (El) 

where x/ty) is the measurement from signal i at time ty. We 
then define the data collection matrix as the process 
30 memory matrix D: 
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Each of the column vectors (X(ty)) in the process memory 
matrix represents an operating state of the asset. Any number 
40 of observation vectors can be assigned to the process 
memory matrix. Training an MSET model includes collect- 
ing enough unique observation vectors from historical 
operation of the asset during normal conditions such that the 
process memory matrix encompasses the full dynamic oper- 
ating range of the asset. Computation of the D matrix is the 
45 first of three steps in the method for training an MSET model 
based on historical operating data. 

One of at least two algorithms is used by MSET to select 
the vectors in the D matrix. The MinMax algorithm extracts 
vectors that bound the vector space defined by the training 
50 data and returns the smallest process memory matrix that 
will produce an effective system model (see also U.S. Pat. 
No. 5,764,509 which is hereby incorporated by reference in 
its entirety). A vector ordering algorithm selects and 
includes representative vectors from the inner regions of the 
vector space producing a more accurate system model. 

55 Once the process memory matrix has been constructed, 
MSET is used to model the dynamic behavior of the system. 
For each current observation of the system (Xobs), MSET 
compares the observation vector to the stored operating 
states to calculate an estimate of the process parameter 
60 values. The parameter estimate of the current process state 
(Xest) is an n-element vector that is given by the product of 
the process memory matrix and a weight vector, W: 


x es( =D'W 


(E3) 
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The weight vector represents a measure of similarity 
between the estimate of the current state and the process 
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memory matrix. To obtain the weight vector, we minimize 
the error vector, R, where: 

R=X obs -X est (E4) 

The error is minimized for a given state when: 

W=( D r © -i) _1 -( D© X obs ) (E5) 

This equation represents a “least squares” minimization 
when the pattern recognition operator © is the matrix dot 
product. Several advanced pattern recognition operators 
have been defined that provide excellent parameter estima- 
tion performance. Pattern recognition operators used by 
MSET include, but are not limited to, the System State 
Analyzer (SSA) method (see also U.S. Pat. No. 4,937,763 
which is hereby incorporated by reference in its entirety), the 
Bounded Angle Ratio Test (BART) method (see also U.S. 
Pat. No. 5,987,399 which is hereby incorporated by refer- 
ence in its entirety), the Vector Pattern Recognizer (VPR) 
method, the Vector Similarity Evaluation Technique (VSET) 
method, and the Probabilistic State Estimation Method 
(PSEM). 

Once the weight vector is found, the resulting current state 
estimate of the system (i.e., the parameter estimate vector) 
is given by; 

X„,= D (D© -irMD© X obs ) (E6) 

The first application of the pattern recognition operator in 
equation E6 (D r ©D) involves a comparison between the 
row vectors in the D r matrix and each of the column vectors 
in the D matrix. If we define G=D r ©D, then G, the 
similarity matrix, is an m by m matrix. The element in the 
i-th row and j-th column of the matrix (g t - •) represents a 
measure of the similarity between the i-th and j-th column 
vectors (i.e., memorized states) in the process memory 
matrix. The second application of the pattern recognition 
operator in equation E6 (D r © X ofas .) involves a comparison 
between the row vectors in the D r matrix and each of the 
elements in the observation vector Xobs. If we define 
A=D r ©X ofas ., then A, the similarity vector, is an m by 1 
vector. Each element in the similarity vector is a measure of 
the similarity between the observation vector and the i-th 
column vector (i.e., memorized state) in the process memory 
matrix. 

Note that the similarity matrix is a function of the process 
memory matrix only. Thus, the similarity matrix and its 
inverse Ginv=(D r ©D) _1 can be calculated as soon as the 
process memory matrix has been derived thereby making the 
application of MSET to an on-line surveillance system more 
computationally efficient. Computation of the Ginv matrix 
initializes the parameter estimation model and completes the 
second of three steps in the procedure for training an MSET 
model based on historical operating data. 

The third and final step in the MSET training procedure 
includes analyzing the historical training data using equation 
E4 to produce a residual error vector, R, for each observation 
vector in the training data. The collection of residual error 
vectors comprises the training data residuals necessary for 
training the fault detection model 54 using any one of a 
plurality of techniques, including but not limited to the 
SPRT technique, and the novel ASP technique that is a 
hallmark of the instant invention. 

The Sequential Probability Ratio Test (SPRT) technique is 
a statistical hypothesis test fault detection algorithm histori- 
cally used for MSET process surveillance. The SPRT tech- 
nique is described in U.S. Pat. No. 5,459,675, which is 
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incorporated herein by reference in its entirety. The SPRT 
analyzes a sequence of discrete residual error values from a 
signal to determine whether the sequence is consistent with 
normal signal behavior or with some other abnormal behav- 
5 ior. When the SPRT reaches a decision about the current 
signal behavior, e.g., that the signal is behaving normally or 
abnormally, the decision is reported and the test continues 
analyzing the signal data. For any SPRT, signal behavior is 
defined to be normal when the signal data adheres to a 
10 Gaussian probability density function (PDF) with mean 0 
and variance a 2 . Normal signal behavior is referred to as the 
null hypothesis, H 0 . MSET employs four specific SPRT 
hypothesis tests. Each test determines whether current signal 
behavior is consistent with the null hypothesis or one of four 
15 alternative hypotheses. The four tests are known as the 
positive mean test, the negative mean test, the nominal 
variance test, and the inverse variance test. For the positive 
mean test, the corresponding alternative hypothesis, H 1? is 
that the signal data adhere to a Gaussian PDF with mean +M 
20 and variance cr 2 . For the negative mean test, the correspond- 
ing alternative hypothesis, H 2 , is that the signal data adheres 
to a Gaussian PDF with mean -M and variance a 2 . For the 
nominal variance test, the corresponding alternative 
hypothesis, H 3 , is that the signal data adheres to a Gaussian 
25 PDF with mean 0 and variance Va 2 . For the inverse variance 
test, the corresponding alternative hypothesis, H 4 , is that the 
signal data adheres to a Gaussian PDF with mean 0 and 
variance c rVV. The user- assigned constants M and V control 
the sensitivity of the tests. 

30 Limitations of the SPRT Fault Detector Training and Sur- 
veillance Method and System 

One significant shortcoming of the SPRT technique is 
found in the assumptions underlying its mathematical for- 
mulation. Specifically, the SPRT technique presumes that 
35 the residual error signals adhere to a Gaussian probability 
density function. For residual error signals that are non- 
Gaussian, the fault detector false alarm rates and/or missed 
alarm rates specified by the designer are not accomplished 
by the SPRT procedure thereby degrading the fault decision 
40 accuracy of the asset control and/or surveillance system. The 
novel ASP technique of the instant invention specifically 
removes the assumption that the residual error signals 
adhere to a Gaussian probability density function. The ASP 
technique implements any one of a plurality of methods to 
45 numerically fit a probability density function to the observed 
residual error signal distribution that is characteristic of 
normal asset operation. The derived probability density 
function is then used to perform a dynamic statistical 
hypothesis test thereby achieving the fault detector false 
50 alarm and missed alarm rates specified by the designer and 
improving the fault decision accuracy of the asset control 
and/or surveillance system. 

Fault Detection Using Statistical Hypothesis Test Proce- 
dures 

55 The general theory underlying the statistical hypothesis 
test will now be delineated below. Next, the SPRT imple- 
mentation of a dynamic statistical hypothesis test will be 
described. Finally, the novel ASP implementation of a 
dynamic statistical hypothesis test for non-Gaussian residual 
60 error signals will be delineated in detail along with a 
delineation of its reduction to practice. 

B ayes’ Rule for a Single Observation 

Statistical decision problems in which there are just two 
possible outcomes constitute an important class called 
65 binary hypothesis testing problems. The possible states of a 
system are called hypotheses and each individual state of the 
system is termed a simple hypothesis. A simple hypothesis 
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is a complete specification of a probability distribution for 
the system (i.e., the distribution of possible observations or 
samples from the system). The “hypothesis” being tested is 
that the particular distribution is the correct one. 

The basic operation in a binary hypothesis test is to 5 
evaluate the veracity of a hypothesis, H, given a piece of 
evidence or observation, e, from the system being studied. 
Because of the unpredictability or element of chance inher- 
ent in the system, the test deals with the probabilities that 
events occur or that hypotheses are true. The probability that 
a hypothesis is true given a piece of evidence is written as 
P(H|e). The notation identifies a conditional probability — 
namely the probability that the hypothesis is true under the 
condition that the event has occurred with absolute certainty. 
Bayes’ well known inversion rule, as described in Probabi- 
listic Reasoning in Intelligent Systems: Networks of Plau- 15 
sible Inference, by Judea Pearl, Morgan Kaufmann 
Publishers, Inc., San Mateo, Calif., 1988, Second Edition at 
pages 29 through 39, provides a means for evaluating the 
conditional probability of a hypothesis, P(H|e), which is also 
known as the posterior probability, 20 


P{H\e) = 


P(e\H)P(H) 

P(e) 


(E7) 


P(e|H) is the probability that the observation would occur 
if the hypothesis is true. P(H) is the probability that the 
hypothesis is true before any observations of the system 
have been made, also known as the prior probability. The 
denominator, P(e), is the unconditional probability that the 
observation occurs. 

Basic axioms of set theory can be used to prove the 
following identity for two events R and S: R=(RPlS)U(Rn 
_,S), where --S is the converse of event S. In probability 
theory, the analog of this identity is 

P(R)=P(R, S)+P(R ->S), (E8) 


The complementary form of Bayes’ rule provides the 
posterior probability for the converse of the hypothesis. It is 
evaluated by substituting -«H for each instance of H in 
equation Ell and noting that -i(-«H)=H: 


A- H\e) = 


P{e h H) 
P(e\H)P(H) + P(e |-- H ) ' 


(El 2) 


An alternate form of Bayes’ rule is produced by dividing 
it (i.e., equation Ell) with its complementary form (i.e., 
equation E12) to obtain 

P(H\e) _ P(e\H ) P(H ) (El 3) 

P(-. H\e) ~ P(e h H ) P (- 1 H) ' 


This form of Bayes’ rule is further manipulated by first 
defining the prior odds on hypothesis H as 


0(H) = 


P(H ) 

P(-> ny 


(El 4) 


the likelihood ratio as 


lXe\H) = 


P(e\H) 
P(e h //)’ 


(El 5) 


and the posterior odds on H as 


0(H\e) = 


P(H\e) 
P(~> H\e)' 


(El 6) 


Bayes’ rule then specifies the posterior odds as the 
product of the likelihood ratio and the prior odds, 


where the notation P(R,S) is used represent the probabil- 
ity of the joint event RPlS. The multiplication law states 40 
that the probability of two events occurring jointly can 
be expressed as a function of the conditional probabil- 
ity of one event based on the other, 

P(R, s)=P(R\s)P(R). (E9) 45 

If R and S are independent events, then P(R|S)=P(R) and 
the multiplication law simplifies to P(R, S)=P(R) P(S). 

Bayes’ rule can be simplified by eliminating the denomi- 
nator in equation E7. Combining equations E8 and E9, and 
substituting e for R and H for S, the denominator can be 50 
written as 

P(e)=P(e\H)P(H)+P(e\->H)P(^H). (E10) 

Therefore, Bayes’ rule becomes 


0(H\e)=L(e\H)0(H). (E17) 

Baves’ Rule for a Time Series 

The formulation above specifies Bayes’ rule for a single 
observation and a binary hypothesis. For the application of 
binary hypothesis tests to real world signals, the formulation 
must be able to handle a sequence of discrete observations. 
This is accomplished by beginning with a single observation 
and successively updating Bayes’ rule for each successive 
observation. Let the sequence {Y n } be an ordered set of n 
elements, {Y n }=y 1? y 2 , . . . , y n , in which the elements are 
observations of the signal made at n discrete moments in 
time such that t 1 <t 2 < . . . <t n . Bayes’ rule for the first 
observation (y 2 ) in the time series is 

, _ P(yi\H)P(H) (El 8) 


P(H\e) = 


P(e\H)P(H) 

P(e\H)P(H) + P(e |-- H)P(-n H) 


(Ell) 


Adding the second observation from the time series, 
Bayes’ rule for the joint event y ± C\y^ is 


The power of Bayesian techniques comes primarily from 
the fact that in causal reasoning the relationship P(e|H) is 
local, namely, given that H is true, the probability of e can 
be estimated naturally and is not dependent on many other 
propositions. For instance, given that the measurements 
from an experiment adhere to a particular PDF, the prob- 
ability that any single measurement will occur is easily 
computed 


x P( yi ,y 2 \H)P(H) (El 9) 

W yi ,y 2) = . 

With the aid of the multiplication law (equation E9), the 
joint event probabilities are converted to conditional prob- 
abilities so that the right hand side of this equation can be 
rewritten as 


60 


65 
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^ x P{yi\yuH)P{ yi \H)P{H) (E20) 

P(H\yi,yi) = D , , ND , , ■ 

P(y2\yi)P(yO 


Note that the probability of the joint event y i riy 2 is 
written as P(y 1? y^P^JyJ P(yi) instead of the equivalent 
P(y 1? y 2 )=P(y 1? y 2 ) P(y 2 ) because of the temporal depen- 
dency of the data. The second form of the multiplication law 
reduces to P(y 1? y 2 )=P(yi) P(y 2 ) because earlier events (e.g., 
yj in a time series cannot be dependent on later events (e.g., 

y 2 )- 

The multiplication law is used for each successive obser- 
vation in the time series to derive the form of Bayes’ rule for 
the joint event (y i ny 2 Pl . . . Ply n ): 


TXH\y u yi, ... ,y n )= (E21) 

yi, »)••• PtoLvi, fmympm 

P{y n \y n - 1 , • • • , yi )• • • P(y2\yi )P(yi ) 


Since the probabilities for each of the later observations in 
the time series are conditioned on earlier observations, 
Bayes’ rule is difficult to solve in the general case. But if the 
observations in the series are independent of each other (i.e., 
random), then the probabilities will be dependent on the 
hypothesis only. In this case the conditional probability for 
the ith observation, P(yJy J _ 1 , . . . , y 1? H), is simply P(yjH). 
Thus, Bayes’ rule for independent observations in a time 
series is 

P(H\ yi ,y 2 ,... ,y n ) = (E 22 ) 

P{y ri \H)...P(y i \II)P{H) = A P{y,\H) 
Piyn)...P(y i) P(yO ' 


If an explicit time dependency can be established for the 
observations in the time series (i.e., a function is found that 
relates earlier events to later events), then the general form 
of Bayes’ rule (equation E21) can be used to develop failure 
detection models for serially-correlated signals. However, 
the residual signals formed by the difference between the 
observed and estimated signal values are in general random 
signals; thus, Bayes’ rule for random time series is used as 
the basis for the fault detection models. 

Dividing by the complementary form of Bayes’ rule for 
random time series and utilizing the definition of the pos- 
terior odds, prior odds, and likelihood ratio from above, an 
alternate form of Bayes’ rule for a time series is developed: 


0(H\y u y 2 , = 0(H)\\ Uyi\H). 

1 = 1 


(E23) 


If we take the logarithm of this equation, the incremental 
nature of Bayesian formulation becomes more apparent. 
Equation E24 shows the log of the likelihood ratio as a 
weight, carried by each observation in the sequence, which 
additively sways the belief in the hypothesis one way or the 
other. 


A (E24) 

HO(H\ yi , y 2 , ... , y n )] = ln[0(//)] +2] InU^il*)]- 

/= l 


Sequential Hypothesis Tests 

Wald first presented and studied the following sequential 
test of a simple hypothesis against a simple alternative, as 


20 

described in Sequential Analysis, by A. Wald, John Wiley 
and Sons, Inc., New York, 1947. Let H 0 be a specific 
probability density function called the null hypothesis. Then 
the probability that the time series {Y„} contains samples 
5 drawn from H 0 is P(y 1? y 2 , . . . , yjH 0 ). Let H [■ be a different 
probability density function called the alternative hypoth- 
esis. Then the probability that the time series {Y n } contains 
samples drawn from H ; is P(y 1? y 2 , . . . , y„|H-). Two 
threshold limits A and B are chosen, with A<B, and after 
10 each observation in the series the following statistic (Ay n ) is 
calculated: 


_ P(yi,y2, - ,y n \Hj) (E25) 

J * ~ P(yi,y2, ■■■ ,y n IW 

15 

The test procedure is then as follows. If the statistic is 
greater than or equal to the upper threshold limit (i.e., 
Ay^^B), then a decision is made to accept hypothesis Hy as 
20 true. If the statistic is less than or equal to the lower 
threshold limit (i.e., A jn ^A), then a decision is made to 
accept hypothesis H 0 as true. If the statistic falls between the 
two limits (i.e., A<Ay „<B), then neither hypothesis can yet 
be accepted to be true and sampling continues. If the 
25 observations in the series are independent random variables, 
then the test statistic reduces to a product of conditional 
probabilities: 


30 J * P(yi\H 0 y 


(E26) 


In the sequential hypothesis tests, the logarithm of the test 
statistic is often easier to work with: 


35 


In Ay^ 



P(yi\Hj) 

P(yi\H 0 ) 


(E27) 


40 The sequential hypothesis test consists of calculating the 
logarithm of the test statistic for each observation in the 
series and comparing the result to the logarithms of the 
lower and upper threshold limits. 

The statistic in the sequential hypothesis test is a product 
45 of a sequence of likelihood ratios. Each term in the product 
is the ratio of a probability conditioned on one hypothesis to 
a probability conditioned on a second hypothesis. The 
difference between the likelihood ratios in the sequential 
hypothesis tests and those in Bayes’ rule (see equations E15 
50 and E23), is that in Bayes’ rule the probabilities are condi- 
tioned on a hypothesis and its converse, whereas in the 
sequential hypothesis tests the probabilities are conditioned 
on two hypotheses from a set of related hypotheses. In 
principle, the two hypotheses in the sequential hypothesis 
55 tests could be the converse of each other (i.e., the set 
contains two elements), but in practice the hypotheses are 
selected from an infinite set of exhaustive and mutually 
exclusive hypotheses. For instance, suppose the null hypoth- 
esis is a Gaussian PDF with a mean of 1 and a variance 10. 
60 Then a sequential hypothesis test can be defined in which the 
alternate hypothesis is any Gaussian PDF in which the mean 
is not 1 and/or the variance is not 10 . 

It is informative to compare the sequential hypothesis 
tests to Bayes’ rule for a time series. Given a null hypothesis 
65 H 0 and an alternative hypothesis Hy, the likelihood ratio for 
an observation e conditioned on the two hypotheses is 
defined as 
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Lj(e\H 0 ) = 


P(e\Hj) 

P(e\H 0 y 


(E28) 


The subscripts emphasize the fact that the two hypotheses 
are selected from an infinite set of related hypotheses. The 
prior odds for the two hypotheses are defined as 


Oj(H 0 ) : 


P(Hj) 

P(H 0 y 


(E29) 


while the posterior odds are defined as 


Oj(H 0 \e) = 


P{Hj\e) 

P(H 0 \ey 


(E30) 


decision to accept hypothesis H 0 is erroneous is denoted by 
|3. A type II decision error is also called a missed alarm and 
the probability of a type II error is called the missed alarm 
probability. 

5 The sequential hypothesis tests are open-ended. The tests 
will continue to collect observations from the system and 
update the test statistic until the test statistic satisfies one of 
the two decision conditions. In principle, the number of 
observations needed to reach a decision can be any positive 

10 integer, although it can be shown that a decision will be 
reached in a finite number of observations. Since the number 
of observations needed to make a decision is indeterminate, 
the probability that a decision is erroneous is found by 
summing the probability of an erroneous decision being 

15 made after 1 observation, 2 observations, and so on. 
Formally, in terms of the threshold limits A and B that define 
the test, the false alarm probability is given by: 


Assuming the observations in the time series are 
independent, then Bayes’ rule conditioned on hypothesis Hy 
(i.e., equation E22 with the symbol H replaced by Hy) can be 
divided by Bayes’ rule conditioned on hypothesis H 0 to 
produce Bayes’ rule for a sequential hypothesis test: 

« (E31) 

Oj(H 0 \ yi ,y 2 , ... , y n ) = Oj(H 0 )Yl Lj( yi \H 0 ). 


a=P(A jl >B \H Q )+P(B<Aj' 2 <A, A ja ^B\H 0 )+ . . . (E33) 

20 

The first term in the sum is the probability that the test 
statistic drops below the lower threshold limit after only one 
observation given that the alternative hypothesis is true. The 
second term is the probability that the test statistic drops 
25 below the lower threshold limit after two observations given 
that the alternative hypothesis is true. Similarly, the missed 
alarm probability is given by: 


Dividing this equation through by the prior odds, it 
becomes apparent that the statistic in the sequential hypoth- 
esis test is just the ratio of the posterior odds to the prior 
odds: 


Oj(Ho\y lt y 2 , ■■■ , y n ) 
Oj(H 0 ) 


fl Lj(y,\H 0 ) m A j. 


(E32) 


The prior odds are the ratio of the probability that the 
alternative hypothesis is true to the probability that the null 
hypothesis is true, before any data have been collected from 
the system. In many cases, these probabilities are not 
known — no prior information about the system is known. In 
the absence of prior information about the system, these 
probabilities are taken to be V 2 , making the prior odds equal 
to 1. Thus in the absence of prior information, the test 
statistic Ay n is equal to the odds that the system adheres to 
the alternative PDF as opposed to the null PDF. 

Error Probabilities for the Sequential Hypothesis Tests 
Because the sequential hypothesis tests make decisions 
based on probabilities, there is always a finite probability 
that a decision reached by the test is erroneous. If a sequen- 
tial hypothesis test makes a decision to accept the alternative 
hypothesis Hy as true when the null hypothesis H 0 is true, 
then an error of type I is said to have occurred. If a sequential 
hypothesis test makes a decision to accept the null hypoth- 
esis H 0 as true when the alternative hypothesis Hy is true, 
then an error of type II is said to have occurred. Although the 
designation is arbitrary, it stems from situations in which one 
kind of error is more serious than the other. Since the normal 
usage of the sequential hypothesis tests is to detect a change 
in signal response from its normal behavior (i.e., hypothesis 
H 0 ) to some abnormal behavior (i.e., hypothesis Hy), the 
error of accepting H ; when H 0 is true is the more serious 
error. The probability that a decision to accept hypothesis Hy 
is erroneous is denoted by a. A type I decision error is also 
called a false alarm and the probability of a type I error is 
called the false alarm probability. The probability that a 


Ay , 1 \Hj)+P(B<Aj 2 <A, A ja <A\H^ . . . (E34) 

30 

These expressions are by no means easily computed. 
Moreover, one could not hope to solve these equations for A 
and B in terms of given a and (3, despite the desirability of 
being able to do so in setting up a test to provide a specified 
35 protection. Although these equations cannot be solved, it can 
be shown that the error probabilities and the threshold limits 
are related by the following inequalities 

a 1 B (E35) 

an A ^ — — — and — s — -. 


The error probabilities and the threshold limits are related 
by inequalities because the test statistic A } - n does not usually 
45 attain exactly the value A or the value B when the test is 
completed. But since a decision is declared as soon as an 
observation drives the test statistic past either threshold, the 
inequalities are almost equalities. Indeed, in practice, A and 
B are taken to be equal to a/(l-|3) and (l-a)/|3, respectively. 
50 Doing so, of course, means that the sequential hypothesis 
test actually carried out has error probabilities that are 
somewhat different than those specified. Let a' and (3' denote 
the empirical error probabilities actually attained by a test 
using specified threshold limits of A=a/(1-|3) and B=(l-a)/ 
55 p. Then according to the inequalities in equation E35, the 
empirical error probabilities (i.e., a' and |3') are related to the 
pre assigned error probabilities (i.e., the values of a and (3 
used to specify the threshold limits) by 

60 or' a 1 /? ( E36 ) 

< A = and < — = . 

(1-0) (1-0 U-aO B (1 -or) 


Multiplying these through to eliminate denominators, one 
65 obtains 


a’a'p^a-ap’ and p’-ap’^p-a’p. 


(E37) 
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Adding these two equations together, one obtains an 
inequality relating the empirical error probabilities to the 
pre assigned error probabilities: 


N(y, n, a 2 ) -- 


T-yfUr 


exp 




(E40) 


a’+P’^a+p. (E38) 

One of the key features of the hypothesis test technique is 
that the designer can specify the error probabilities. This is 
particularly important for type I errors, because false alarms 
can cause an operator to make an incorrect decision. Type II 
errors typically do not lead to incorrect decisions. This is 
because in the event that a real failure does occur, missed 
alarms may delay the time to detect the failure but not the 
ability to detect the failure. The result above (equation E38) 
shows that the preassigned false alarm probability is not a 
strict upper limit for the empirical false alarm probability. 
Similarly, the preassigned missed alarm probability is not a 
strict upper limit for the empirical missed alarm probability, 
even when the hypothesis test is applied to purely random 
data. It is the sum of the preassigned error probabilities that 
is an upper limit for the sum of the empirical error prob- 
abilities. Thus it is possible with purely random data for one 
of the empirical error probabilities to exceed its correspond- 
ing preassigned error probability, but both empirical error 
probabilities cannot be greater than their corresponding 
pre assigned error probabilities. 

True upper bounds for the empirical error probabilities 
can be determined from the inequalities in equation E36, 


a' <, 


a' 

(l-jfr) 


a 


and ft <, 


P 

(1 — or') 


fi 


(E39) 


For small preassigned error probabilities, the true upper 
bounds are only slightly greater than the pre assigned error 
probabilities. For instance, if both of the preassigned error 
probabilities are 0.01 then the empirical error probabilities 
of the test will not exceed 0.0101. 

The Sequential Probability Ratio Test (SPRT) 

The sequential hypothesis tests described herein above are 
general statistical tests valid for any pair of related hypoth- 
eses. MSET has historically employed four specific sequen- 
tial hypothesis tests to detect signal faults. These four tests 
are called the Sequential Probability Ratio Tests, or SPRTs. 
The SPRTs monitor for changes in the statistical character- 
istics of the residual signals. A residual signal is the differ- 
ence between an actual signal and MSET’s estimate of that 
signal. The SPRTs continually monitor a residual signal, 
generating sequences of decisions. A decision in which the 
null hypothesis is accepted (i.e., Ay n =A) is called a normal 
decision and implies that the residual signal is behaving as 
anticipated. A decision in which the alternative hypothesis is 
accepted (i.e., A y> ^B) is called a fault decision and implies 
that the residual signal is behaving abnormally. 

The null hypothesis upon which the SPRTs are based 
specifies that the residual signal consists of Gaussian data 
that have a sample mean of 0 and a sample variance of a 2 . 
A training procedure, during which the system is operating 
normally, is used to verify that the mean of the signal is 0 and 
to evaluate the variance of the signal. Note that if the 
residual signal does not have a mean of 0, the calculated 
mean from the training phase is used to normalize the 
residual signal for the surveillance procedure using the 
model. Thus the null hypothesis, H 0 , for the SPRTs is that 
the signal being analyzed adheres to Gaussian PDF (N(y; 

a 2 )). 


5 

for which the mean (ju) is 0 and the variance is cr 2 . Thus, the 
conditional probability that a discrete observation (y 2 ) 
occurs given the null hypothesis is expressed as 

P(y i\H 0 )=N(yi, 0, a 2 ). (E41) 

10 

The four SPRT tests historically used with MSET are the 
positive mean test, the negative mean test, the nominal 
variance test, and the inverse variance test. For the positive 
mean test, the corresponding alternative hypothesis, H 1 , is 
15 that the signal data adhere to a Gaussian PDF with mean +M 
and variance cr 2 , where M is the preassigned system distur- 
bance magnitude for the mean test. For the negative mean 
test, the corresponding alternative hypothesis, H 2 , is that the 
signal data adheres to a Gaussian PDF with mean -M and 
20 variance cr 2 . Referring to FIG. 10, the hypotheses for the two 
mean tests are illustrated for M=2 and cr 2 =l. For the nominal 
variance test, the alternative hypothesis, H 3 , is that the signal 
data adheres to a Gaussian PDF with mean 0 and variance 
Vcr 2 , where V is the pre assigned system disturbance mag- 
25 nitude for the variance test. For the inverse variance test, the 
corresponding alternative hypothesis, H 4 , is that the signal 
data adheres to a Gaussian PDF with mean 0 and variance 
cr 2 /V. Referring to FIG. 11, the hypotheses for the two 
variance tests are illustrated for V=2 and a 2 =l. The condi- 
30 tional probabilities that a discrete observation (y 1 ) occurs 
given one of the four alternative hypotheses are expressed as 

T\y t \H0 = N(yr, M, a 2 ), ( E42 ) 

35 Piy i \H 2 ) = N(y i --M,o 2 ), 

Piyi | H 3 ) = N(yr, 0, Vcr 2 ), 

FXy, | H 4 ) = A/fy,; 0, yl 
40 


The logarithm of the test statistic for the four SPRT tests 
can be evaluated by substituting the conditional probabilities 
from equations E41 and E42 into the general formula 
(equation E27) and simplifying. Thus, for the positive mean 


50 


55 
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In Ai,„ 


Z ^\N(yr,M, o 2 ) 
n [ N(yr, 0, cr 2 ) 



i„ 


CXp (-2^ fa - M)2 ) 

A-M 


Z yf - (yj - M ) 2 
2(r2 



SPRT^. 


(E43) 


The equation E43 defines the SPRT index for the positive 
mean test (SPRT^). A SPRT index is defined for each of the 
four sequential hypothesis tests. The SPRT indices are the 
65 actual quantities computed by the SPRT fault detection 
procedure. The SPRT index for the negative mean test 
(SPRT„^) is given by 
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n 

In A2,„ = ^ 1 


N(yi\ -M, cr 2 ) 
N(yi; 0, cr 2 ) 


(E44) 


M A/ M \ 


The SPRT index for the nominal variance test (SPRT„ om ) 
is given by 


\N(yi\ 0, Vo 2 )] 


toA3 -" = 20 n teo; 


O- 2 ) 


■2v 


t exp (-2^) 


V InNcr 2 


1 / )T 

= ex P 


/W ^ 


■zbu^-wi 


V — 1 V~1 ? 1 

: « r > v? - -InV ■ SPRTwm- 

2V(t 2 2 


(E45) 


The SPRT index for the inverse variance test (SPRT f „ v ) is 
given by 


the SPRT technique to non-Gaussian PDFs. In the ASP 
method, the assumption that the data fit a Gaussian PDF is 
relaxed and the test statistic is evaluated for any arbitrary 
data distribution. In the ASP method, the signal is assumed 
5 to consist of random observations that adhere to a specific 
PDF that is a function of the sample mean, variance, and 
possibly higher order terms. The PDF is denoted by the 
general function 3f(y; //, o 2 , . . . ). The parameter list of the 
10 function is open-ended to indicate that additional terms, 
such as the sample skewness, kurtosis, or width of the 
distribution at half-maximum, may be required to charac- 
terize the function. 

The null hypothesis upon which the ASP tests are based 
specifies that the data distribution has a sample mean of 0 
and a sample variance of a 2 . A training phase, during which 
the system is operating normally, is used to verify that the 
mean of the signal is 0 and to evaluate the sample variance. 
20 If the PDF is dependent upon any other additional terms, 
they are also evaluated during the training phase for numeri- 
cally tuning one or more probability functions obtained by 
fitting an open-ended general probability functions to one or 
more data distributions obtained during typical or normal 
25 operating conditions of the asset under surveillance. Thus, 
the conditional probability that a discrete observation (y a ) 
occurs given the null hypothesis is expressed as 


liuVv, = 


2> 


( cr 2 

-I 


N(yi \ 0, cr 2 ) 


(E46) 


(1 — V) * 1 

-2j>? + 2 laVmSPRT ™ 


2 cr 2 


Novel Improvements to the Fault Detector Training and 
Surveillance Procedures 

Having described the SPRT fault detector training and 
surveillance methods herein above, the section below 
describes the novel improvements made by the instant 
invention when used for fault detector training and 
surveillance, said improvements being applicable to any 
asset preferably having at least one characteristic data signal. 
The Adaptive Sequential Probability (ASP) Test 

The SPRT technique is valid only for residual signals that 
are random and adhere to Gaussian distributions. Because 
the MSET parameter estimation algorithm simulates the 
deterministic components of a set of inter-related raw 
signals, the residual signals that are produced within MSET 
are often white -noise processes (i.e., not serially -correlated). 
Although the MSET residual signals are typically white, 
they are often non-Gaussian (not normally distributed). If 
the SPRT fault detection model is applied to signals that are 
either non-Gaussian, non- white, or both, the theoretical false 
and missed alarm probabilities may not be achieved. The 
novel advanced fault detection method of the instant 
invention, herein named the Adaptive Sequential Probability 
(ASP) technique or method, solves the problem associated 
with non-Gaussian, non-white residual signals. The ASP 
method uses binary hypothesis tests that are numerically 
tuned to better accommodate the non-Gaussian data distri- 
butions that are typical for residual error signals produced by 
parameter estimation based techniques, such as MSET. 
Mathematical Foundations of the ASP Test 

The Adaptive Sequential Probability (ASP) method 
defines four new sequential hypothesis tests. The Adaptive 
Sequential Probability method is an advanced failure detec- 
tion technique that broadens the domain of applicability of 


30 i 5 (Vi|^o)=^0i; 0, a 2 , . . . ). (E47) 

Typically, four sequential hypothesis tests (i.e., a positive 
mean test, a negative mean test, a nominal variance test, and 
a inverse variance test) are utilized by the ASP method. For 
35 the positive mean test, the corresponding alternative 
hypothesis, H 1 , is that the signal data adhere to the specified 
PDF with mean +M and variance a 2 where M is a preas- 
signed system disturbance magnitude for the mean test. For 
the negative mean test, the corresponding alternative 
hypothesis, H 2 , is that the signal data adheres to the specified 
PDF with mean -M and variance a 2 . For the nominal 
variance test, the corresponding alternative hypothesis, H 3 , 
is that the signal data adhere to the specified PDF with mean 
45 0 and variance Va 2 where V is a pre assigned system 
disturbance magnitude for the variance test. For the inverse 
variance test, the corresponding alternative hypothesis, H 4 , 
is that the signal data adhere to the specified PDF with mean 
0 and variance c^/V. The conditional probabilities that a 
50 discrete observation (y 2 ) occurs given one of the four 
alternative hypotheses are expressed as 

P[yi | H { ) = 3 O',-; M, o 2 , . . .), ( E48 ) 

55 Pi yi \H 2 ) = 3(y r ,-M,o- 2 , ...), 

p&i I H 3 ) = 3 O';; 0, vy,...), 

Piyi | Ha) = 3 [yr, 0, y , ...). 
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The logarithm of the test statistic for the four ASP tests 
can be evaluated by substituting the conditional probabilities 
from equations E47 and E48 into the general formula 
65 (equation E27). The logarithm of the test statistic for a given 
test is defined to be the ASP index for that test. The ASP 
index for the positive mean test (ASP^) is given by 
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where the standard normal PDF 

Z{X): 


1 X 2 ' 

, = vSr exp N 


(E54) 


The ASP index for the negative mean test (ASP neg ) is 
given by 


In A 2 ,n 
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is a function of the dimensionless variable x=(y The 
standard normal PDF is related to the general Gaussian 
PDF (equation E40) through the normalization condi- 
tion: 


■ ASP ne 


£ r*oo 

Z{x)dx = J N(y; fi, o 2 )dy = 1. 


(E55) 


The ASP index for the nominal variance test (ASP nom ) is 15 
given by 


. . Vi 3 O'/; 
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The ASP index for the inverse variance test (ASP tnv ) is 
given by 


The remainder term can be expressed as an infinite series 
of orthogonal polynomials whose terms are functions of 
derivatives of the standard normal PDF. The constant coef- 
ficients of the polynomial terms are dependent on the central 
moments of the target distribution. As described in Hand- 
book of Mathematical Functions, by M. Abramowitz and I. 
A. Stegun, Dover Publications, Inc., New York, 1972, the 
first four terms in the series expansion of the remainder 
function are: 


In A 4> „ = Z kl 
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The ASP tests are then implemented in the same manner 
as the SPRT tests. Namely, for each time step in the 
calculation, the four ASP indices are calculated (equations 
E49 through E52). Each ASP index is compared to the upper 
and lower log-thresholds and the status of the test is evalu- 
ated (i.e., faulted, normal, or continuing). 

ASP Method for Near-Gaussian Distributions 

The Adaptive Sequential Probability method consists of 
four specific sequential hypothesis tests applied to non- 
Gaussian data distributions. In order to use the method, a 
general PDF, ^(y; //, a 2 , . . . ), must be defined for the target 
signal. In this section, the ASP method is derived for data 
distributions that are nearly Gaussian. 

In applications of MSET to the Space Shuttle Main 
Engine accelerometer signals, applicants discovered that the 
residual signals produced by the system model have a nearly, 
but not tmly, Gaussian behavior, as described in Dynamics 
Sensor Data Validation Phase I Final Report, by applicants: 
Randall L. Bickford and James P. Herzog, NASA Contract 
NAS8-40874, Jul. 1, 1997. When plotted as a histogram, the 
residual data will appear to have the same bell-curve shape 
as a Gaussian distribution. But when a Gaussian PDF of the 
same mean, standard deviation, and area as the data distri- 
bution is superimposed on it, the histogram was found by 
applicants to be non-Gaussian. Typically, applicants found 
that the histogram has thicker tails than the Gaussian curve, 
which corresponds to a sample kurtosis that is greater than 
3. 

A PDF can be written as a sum of a standard normal 
distribution, Z(x), and a remainder term, R(x), as described 
in Mathematical Methods of Statistics, by H. Cramer, Prin- 
ceton University Press, Princeton, N.J., 1946. 65 


- is £ - io S) 2 + 30 )* 61{x) + (Sk 8,w + 


The fi t factors are central moments of the discrete data 
sequence {Y„}=(y 1 , y 2 , , y„), 


' = {y j ~ & with ^ = y j ^ 
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The Z ( ' n \x) functions are derivatives of the standard 
normal PDF. The nth derivative of the standard normal PDF 
is given by: 


55 


&\x) - = (-1 ) n He n (x)Z(x), 

dxf 1 


(E58) 
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where He„(x) are the Hermite polynomials. The first twelve 
Hermite polynomials are 

He 1 (x)=x, 

He 2 (x)=x 2 -1, 

He 3 (x)=x 3 -3x, 

He 4 (x)=x 4 - 6x 2 +3, 


%(x)=Z(x)+R(x), 


(E53) 


He 5 (x)=x 5 - 1 0x 3 + 1 5x, 
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He 6 (x)=x 6 - 15xr’+45x 2 -15, 

He 7 (x)=x 7 -21x 5 +105x 3 -lQ5x, 

He s (x)=x 8 -28x 6 +210x 4 -420x 2 +105, 

He 9 (x)= jr J -36x 7 +378x 5 -1260x 3 +945x, 
He lo (x)-x lo -45x 8 +630x 6 -3150x 4 +4725x 2 -945, 
He 11 (x)=x 11 -55x 9 +990x 7 -693Qx 5 +17325x 3 -lQ395x, 
He 12 (x)=x 12 -66x lo +1485x 8 -13860x 6 +51975x 4 -62370x 2 +1039{£59) 

Each term in the series expansion of the remainder 
function R(x) is more complex than the previous term as 
higher central moments of the data distribution and higher 
derivatives of the standard normal PDF are introduced. The 
nth term in the remainder function depends on the central 
moments from order 1 (i.e., mean) through order n+2. Thus, 
the first term depends on moments through the third central 
moment (i.e., skewness), the second term depends on 
moments through the fourth central moment (i.e., kurtosis), 
and so on. 

If the data distribution that is to be approximated with 
equation E53 is nearly Gaussian, a good approximation will 
be achieved with only a few of the terms in the series. In the 
ASP method for near- Gaussian distributions, the data dis- 
tribution is first approximated with the standard normal PDF, 
Z(x). The approximation is refined by adding successively 
higher terms of the remainder function until an adequate fit 
between the general PDF, 3f(x), and the data distribution is 
produced. In practice it is usually not advisable to go beyond 
the fourth term in the remainder function because the tail 
regions of the general PDF become unstable. 

Using the equations for the derivatives of the standard 
normal PDF (equations E59 and E59), the remainder func- 
tion can be written as a product of the standard normal PDF 
and a polynomial whose coefficients depend on the central 
moments of the data distribution. 


R(x) = Z(x)J^n(x), 


(E60) 


where the first four terms in the series are 
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(E64) 


Thus, the series expansion formula for the approximation 
of the general PDF is 


n 

3 (x ) » z(x) + z(x)^ n(x). 


(E65) 


Application of the ASP Method to Space Shuttle Main 
Engine Residual Signals 

In one application use and operation, and in one preferred 
embodiment, the ASP technique was reduced to practice to 
io provide highly reliable data validation for high frequency 
Space Shuttle Main Engine (SSME) accelerometers. Each 
SSME is monitored by eight high frequency (10.24 kHz) 
accelerometers with three sensors mounted on each of the 
two high pressure turbopumps and two sensors mounted on 
5 the engine gimbal bearing. Several MSET models were 
prepared to validate the ASP method for the six turbopump 
accelerometers. The MSET pattern recognition algorithm 
produces extremely accurate simulations of turbopump 
accelerometer behavior. Referring to FIG. 12, some results 
from applying the six sensor MSET model to a typical Space 
20 Shuttle flight (i.e., flight STS057, engine 1) are shown. For 
the calculation shown in FIG. 12, the MSET training algo- 
rithm extracted a small fraction of the data in the acceler- 
ometer signals to simulate the behavior of the accelerom- 
eters (the process memory matrix contained only 0.0083% 
of the raw signal data). In the top plot, the highly dynamic 
signal from the first SSME turbopump accelerometer in the 
model (i.e., the oxidizer preburner pump 45 accelerometer) 
is shown as a function of time from the launch of the Space 

Shuttle. In the middle plot, MSET’s estimate of the accel- 
30 . r 

erometer signal shows the very high accuracy of the MSET 
simulation. The residual signal for the calculation, which is 
the difference between the sensor signal and MSET’s esti- 
mate of that signal, is shown in the bottom plot. The relative 
error in the calculation, as measured by the ratio of the 
standard deviation of the residual signal to the standard 
deviation of the sensor signal, is only 0.86%. The results 
shown in FIG. 12 illustrate MSETs ability to accurately 
model highly dynamic signals, even when only a tiny 
4Q fraction of available data is used to represent the signal 
behavior. 

Still referring to FIG. 12, the residual signal for the 
analysis of a typical SSME accelerometer appears to be 
random. Referring to FIG. 13, the distribution of data in the 
residual signal is shown. The probability density for the 
residual signal was approximated by dividing the data range 
into m equally-sized bins and then counting the number of 
residual signal data values that fall into each bin. The count 
number is a discrete function of the sequence {Y m }=y 1? 
50 Y 2 ? • • • ? y where the ith element in the sequence (y 1 ) is 
given by 


y 1 = ^min 


. Umax ^min) 


(E66) 


55 

and y min is the minimum datum and y max is the maximum 
datum in the residual signal. A normalized residual signal 
PDF is produced by dividing each element in the count 
number function by the total number of elements in the 
60 residual signal. The number of bins used to produce the 
residual signal PDFs was 1000. As shown in FIG. 13, a PDF 
for a typical residual signal has a nearly Gaussian shape. 
Superimposed on top of residual signal PDF is a Gaussian 
PDF of the same mean, standard deviation, and area as the 
65 residual signal PDF. The first four moments of the residual 
signal data are as follows: mean=l.l*10 -4 g, standard 
deviation=8.2*10 -2 g, skewness=0.32, and kurtosis=5.7, 
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where the skewness and kurtosis are related to the third and 
fourth central moments of the data by 


JU3 jJ.4 

skewness = — and kurtosis = — . 

cr 3 cn 


(E67) 


Because the kurtosis of the distribution is greater than 3, 
the residual signal PDF has thicker tail regions than a true 
Gaussian distribution. This is confirmed by a visual exami- 
nation of FIG. 13 . 

The ASP method for near- Gaussian distributions was 
applied to the residual signals from the MSET model of the 
SSME accelerometers. The residual signal PDFs were 
approximated with the series expansion formula of a general 
PDF (equations E60 through E65). Four calculations were 
performed for each residual signal in the model. In the first 
calculation, the residual signals were approximated with the 
one-term series expansion formula. In the one-term formula, 
the remainder function (R(x)) is given by the product of the 
standard normal PDF and the first term in the series, r 1 (x). 
Subsequent calculations introduced additional terms from 
the series, culminating with the four-term formula in which 
the remainder function is given by the product of the 
standard normal PDF and the four term series, r 1 (x)+r 2 (x)+ 
r 3 (x)+r 4 (x). 

For each calculation, the quality of the approximation is 
measured by the root mean-squared (rms) error of the 
calculation. The rms error (E^) is a function of the differ- 
ence between the calculated PDF (£f(x)) from equation E65 
and the residual signal PDF (F(x)), 


Erms 


1 M 


(E68) 


The calculated and residual signal PDFs are discrete 
functions of the dimensionless variable x I =(y 1 - J w)/a, where 
y 2 is an element in the sequence that spans the range of data 
in the residual signal distribution (see equation E66). 

In general terms, each additional term in the series expan- 
sion improves the approximation of the residual signal 
PDFs. Referring to FIG. 14 , FIG. 15 and FIG. 16 , adding 
terms from the series expansion formula improves the fit of 
the residual signal PDF from the first sensor. The approxi- 
mation of the residual signal PDF generated by the one-term 
series expansion formula is compared to a Gaussian approxi- 
mation and the residual signal PDF in FIG. 14 . The one-term 
approximation shows little improvement over the Gaussian 
approximation. The rms error for the Gaussian PDF is 0.414, 
while the rms error for the one-term series expansion PDF 
is slightly smaller at 0.409. The PDF generated by the 
two-term series expansion formula is shown in FIG. 15 . The 
fit of the two-tern approximation is significantly better than 
that of the Gaussian approximation. The two- term formula 
provides an excellent fit of the residual signal PDF, espe- 
cially near the peak in the distribution. The greatest source 
of error in the approximation occurs in the transition regions 
of the distribution, between 1 and 4 standard deviations from 
the mean. The rms error for the two- term series expansion 
PDF is 0.122, which is nearly a factor of four smaller than 
the rms error of the Gaussian PDF. The PDF generated by 
the three-term series expansion formula is shown in FIG. 16 . 
The three -term formula provides a better fit of the peak in 
the distribution than does the two -term formula. The rms 
error for the three-term approximation is 0.151, which is 
slightly larger than that of the two -term approximation 
because its fit of the lower transition region is less accurate 
than that of the two- term approximation. 


The rms errors of the four calculations for each of the 
sensors in the model are listed in FIG. 17 . Also included in 
FIG. 17 are the rms errors for Gaussian PDFs of the same 
mean, standard deviation, and area as the residual signal 
5 PDFs. As indicated by the data in the first two columns of 
FIG. 17 , the one-term series expansion formula produces 
PDFs that are slightly more accurate than the corresponding 
Gaussian PDFs. Significant improvement is exhibited by the 
two-term series expansion formula, which produces PDFs 
10 whose rms errors are as much as a factor of four smaller than 
those from the corresponding Gaussian and one-term series 
expansion approximations. The three -term series expansion 
formula generally produces larger rms errors than does the 
two -term formula because the fit of the transitions regions of 
15 the distribution is less accurate. This trend is further exhib- 
ited by the four-term series expansion formula. For three of 
the calculations (i.e., sensor numbers 1, 2, and 6), the 
four-term approximations are unstable in the transition 
regions, resulting in rms errors that are much larger than 
20 those of the corresponding Gaussian approximations. The 
four-term series expansion formula produces an improved 
approximation for the residual signal from sensor number 5 
only. The residual signal for this accelerometer is the most 
nearly Gaussian of the sensors to begin with, as indicated by 
25 the rms errors for the Gaussian approximations for all six 
sensors. These results suggest that the higher order terms in 
the series expansion should be used with caution: only those 
PDFs that are nearly Gaussian to begin with should be 
approximated with terms higher than the second term in the 
30 series expansion. 

In the ASP method for near-Gaussian distributions, the 
central moments of the distribution to be approximated are 
used to evaluate the coefficients in the series expansion. A 
second approach, known as the optimized ASP method for 
35 near-Gaussian distributions, was also evaluated. In the sec- 
ond approach, the higher-order central moments (i.e., ju 3 , ju 4 , 
ju 5 , and ju 6 ) are treated as free parameters in the model and 
the approximation is optimized by searching for values of 
the central moments that provide the best fit of the distri- 
40 bution. In this approach, the simplex optimization method is 
used to minimize the rms error of the calculation, as 
described in A Simplex Method for Function Minimization, 
by J. A. Nelder and R. Mead, Computer Journal, Vol. 7, 
1965, at pages 308 through 313. The simplex method is 
45 particularly useful for minimizing complicated transcenden- 
tal functions of two or more variables because it minimizes 
a function by utilizing evaluations of the function only — no 
evaluations of function derivatives are required. 

In the optimized ASP method for near-Gaussian 
50 distributions, the first step is to select the approximation 
formula to be optimized, either the one, two, three, or 
four-term series expansion formula. The number of free 
parameters in the model is determined by the approximation 
formula used. For instance, the one-term formula requires 
55 only one free parameter (i.e., ju 3 ), whereas the four- term 
formula requires four free parameters (i.e., fo, ju 4 , / 1 5 , and 
ju 6 ). The function that is minimized by the simplex algorithm 
is the rms error for the approximation, given by equation 
E68. The higher-order central moments of the distribution 
60 are used as initial values of the free parameters in the model. 
The simplex algorithm iterates on the free parameters until 
a minimum in the function is found, thereby producing the 
best fit (i.e., smallest rms error) between the calculated PDF 
and the residual signal PDF, for a given approximation 
65 formula 

The rms errors of the calculations using the optimized 
ASP method for near-Gaussian distributions are listed in 
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FIG. 18 . Also included in FIG. 18 are the rms errors for 
Gaussian PDFs of the same mean, standard deviation, and 
area as the residual signal PDFs. Comparing the rms errors 
of the Gaussian approximations to those from the optimized 
calculations shows that all optimized series expansion for- 
mulas produce significant reductions in the rms error. Com- 
paring the data in FIG. 18 to those of FIG. 17 shows that the 
optimized ASP method produces more accurate approxima- 
tions than does the ASP method for each series expansion 
formula. For a given sensor, the optimized two-term series 
expansion formula produces rms errors that are roughly a 
factor of two smaller than those of the optimized one- term 
formula. The optimized three-term and four-term series 
expansion formulas though, produce rms errors that are 
roughly equivalent to those of the optimized two-term 
formula. Since the three-term and four-term models incor- 
porate more complicated functions with more free param- 
eters than the two-term model, these results indicate that the 
optimized two-term series expansion method provides the 
best balance between accuracy and efficiency. 

The PDF generated by the optimized two -term series 
expansion formula for the first sensor is shown in FIG. 19 . 
The optimized two-term formula provides an excellent fit of 
the residual signal PDF across all regions of the distribution, 
including the transition regions. Comparing the results of the 
optimized ASP calculation in FIG. 19 to those from corre- 
sponding ASP calculations in FIG. 15 and FIG. 16 , reveals 
that the main effect of the optimization is to improve the 
accuracy of the approximation through the transition regions 
of the distribution. 

The optimized ASP method for near-Gaussian distribu- 
tions produces an excellent fit of the residual signal distri- 
butions from the MSET model of SSME accelerometer 
signals. Because the optimized ASP method is tuned to the 
specific data distributions analyzed, the empirical error 
probabilities for the method will better adhere to the theo- 
retical limits for the fault detection procedure. 

A parametric study of the empirical false alarm rate was 
performed to compare the false alarm of the SPRT method 
to the new ASP method for the SSME residual signals. In the 
study, the residual signals generated by applying the six 
sensor MSET model to the accelerometer signals from flight 
STS057 engine 1 were analyzed with the SPRT and ASP 
sequential hypothesis tests at various values of the false 
alarm probability and the system disturbance magnitude 
parameters. Because of the high frequency of the acceler- 
ometer signals (10.24 kHz) and long duration of the flight 
(542.4 sec), the residual signals contain over five million 
data values. The residual signal data set was so large that 
every tenth point was extracted from the signals and the fault 
detection models were applied to the subset. The subset of 
the residual signals was large enough to produce accurate 
statistics but small enough to permit multiple calculations in 
a reasonable time span. Typically, the sequential hypothesis 
tests make a decision for every 1 to 10 data points analyzed. 
Thus for the parametric calculations of the SSME residual 
signals, hundreds of thousands of decisions were made for 
each signal. By dividing the number of fault decisions from 
a SPRT analysis of a residual signal into the total number of 
fault and normal decisions made by the test, the empirical 
false alarm rate could be accurately evaluated. 

Referring to FIG. 20 , the results from the application of 
the positive mean SPRT and ASP tests to the residual signal 
from the first sensor are shown. The calculations shown in 
FIG. 20 were performed with a constant preassigned missed 
alarm probability ( (3) of 0 . 01 , a constant system disturbance 
magnitude (M) of 6 , and a variable preassigned false alarm 
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probability (a). The solid diagonal line in the figure repre- 
sents the theoretical upper bound for the empirical false 
alarm probability, as defined by equation E39. For the SPRT 
calculation, the empirical false alarm rate satisfies the theo- 
5 retical upper bound only for preassigned false alarm prob- 
abilities greater than 0.003. For smaller values of the pre- 
assigned false alarm probability, the SPRT model reaches 
more fault decisions than one would expect based on theo- 
retical arguments. The theoretical upper bound for the 
io empirical false alarm probability is not met for all values of 
the pre assigned false alarm probability because the residual 
signals are not purely Gaussian, as confirmed in FIG. 13 . 
The residual signal distribution has more data populating the 
tail regions than does a Gaussian distribution of the same 
15 mean, standard deviation, and area as the residual signal 
distribution. The data from the tail regions of a distribution 
are a major source of false alarms in the SPRT calculations, 
because they are more representative of the alternative 
hypothesis than they are of the normal hypothesis. Because 
20 the residual signals exhibit more heavily populated tail 
regions than true Gaussian distributions, they have a ten- 
dency to trigger more false alarms than anticipated, espe- 
cially at high levels of sensitivity (i.e., small values of the 
false alarm probability). Because the optimized ASP method 
25 is tuned to the specific data distributions exhibited by the 
residual signals, the empirical false alarm rate for the ASP 
calculation satisfies the theoretical upper bound for all 
values of the preassigned false alarm probability. The ASP 
method produces fewer false alarms than the SPRT method 
30 for all values of the preassigned false alarm probability. 
ASP Benefits and Applications 

The Adaptive Sequential Probability (ASP) technique was 
developed as the fault detection element of a software 
program that reliably detects signal data faults for an asset, 
35 such as a process and/or apparatus. These signal validation 
modules improve safety, reduce maintenance cost, and 
enable optimal performance for a wide range of 
aeronautical, industrial, chemical, power generating, 
medical, biological, financial, and military assets. Signal 
40 validation is required in all types of process critical control 
and safety systems where unexpected process interruptions 
due to sensor or control component failures or false alarms 
are unsafe or uneconomical. Signal validation assures the 
safe, reliable operation of a process or apparatus and reduces 
45 the manpower, schedule and uncertainty associated with 
sensor and component failure detection. Signal validation 
prevents a facility safety or control system from making 
critical decisions, such as the decision to shut down a 
process or abort a mission, on the basis of bad sensor data. 
50 Signal validation improves process quality and efficiency by 
ensuring that closed loop control or data analysis is per- 
formed using good data. Finally, signal validation increases 
system up time and decreases system maintenance cost by 
enabling sensor calibration using on-condition criteria rather 
55 than time-in-service criteria. 

Moreover, having thus described the invention, it should 
be apparent that numerous structural modifications and 
adaptations may be resorted to without departing from the 
scope and fair meaning of the instant invention as set forth 
60 hereinabove and as described herein below by the claims. 

We claim: 

1 . A method for performing surveillance of an asset, the 
steps including: 

acquiring residual data correlative to expected asset 
65 operation; 

fitting an equation to said acquired residual data and 
storing said fitted equation in a memory means; 
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collecting a current set of observed signal data values 
from the asset; 

using said fitted equation in a sequential hypotheses test 
to determine if said current set of observed signal data 
indicates unexpected operation of the asset that is 5 
indicative of a fault condition; 

outputting a signal correlative to a detected fault condition 
for providing asset surveillance. 

2. The method of claim 1 wherein the step of acquiring 

residual data correlative to expected asset operation includes 10 
acquiring training data values correlative to expected asset 
operation and obtaining deviations between said training 
data values and values obtained by using a parameter 
estimation model wherein said parameter estimation model 
values are a function of said training data values. 15 

3. The method of claim 2 wherein the step of fitting an 
equation to said acquired residual data and storing said fitted 
equation in a memory means includes the step of fitting a 
general probability density function (PDF) to said acquired 
residual data such that said general PDF is a function of 20 
statistical moments of said residual data. 

4. The method of claim 3 wherein the step of fitting said 
general PDF includes the step of fitting a function of a first 
statistical moment of said residual data, a function of a 
second statistical moment of said residual data, and a 25 
function of at least one higher order statistical moment of 
said residual data. 

5. The method of claim 4 wherein said function of a first 
statistical moment of said residual data is a sample mean. 

6. The method of claim 4 wherein said function of a 
second statistical moment of said residual data is a sample 
variance. 

7. The method of claim 4 wherein said function of at least 
one higher order statistical moment of said residual data is 
a sample skewness. 

8. The method of claim 4 wherein said function of at least 
one higher order statistical moment of said residual data is 
a sample kurtosis. 

9. The method of claim 4 wherein said function of at least 
one higher order statistical moment of said residual data is 
a fifth or higher order statistical moment. 

10. The method of claim 3 wherein the step of fitting an 
equation to said acquired residual data and storing said fitted 
equation in a memory means includes the step of fitting a 
general probability density function (PDF) to said acquired 
residual data by fitting a standard Gaussian PDF to said 
acquired residual data and by then adding successive higher 
order terms to said standard Gaussian PDF until an adequate 
agreement between the so derived general PDF and the 
actual distribution of said acquired residual data is achieved. 

11. The method of claim 3 wherein the step of using said 
fitted equation in a sequential hypotheses test to determine 
if said current set of observed signal data indicates unex- 
pected operation of the asset that is indicative of a fault 
condition further includes the step of performing at least one 
of a group of four sequential hypothesis tests comprised of 
a positive mean test, a negative mean test, a nominal 
variance test, and a inverse variance test all as a function of 
said general probability density (PDF) function. 

12. The method of claim 11 wherein the step of perform- 
ing at least one of a group of four sequential hypothesis tests 
comprised of a positive mean test, a negative mean test, a 
nominal variance test, and a inverse variance test all as a 
function of said general probability density function (PDF) 
includes the step of processing said at least one of a group 
of four sequential hypothesis tests and comparing an out- 
come of each processed test to upper and lower threshold 
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limits for determining unexpected operation of the asset that 
is indicative of a fault condition. 

13. A surveillance system for monitoring an asset, said 
system comprising in combination: 

a data acquisition means for acquiring a current set of 
signals engendered from said asset correlative to asset 
status; 

a digitizing means for digitizing said current set of signals 
for defining a current set of digitized signals; 
a parameter estimation means for producing a set of 
estimated signal values as a function of said current set 
of digitized signals; 

a fault detection means for detecting the presence of a 
fault as a function of said set of estimated signal values 
and said current set of digitized signals; 
said fault detection means including a stored fault detec- 
tion model comprised of an empirically derived prob- 
ability density function utilized in a sequential prob- 
ability test of at least one result of a mathematical 
operation between said set of estimated signal values 
and said current set of digitized signals; 
a communication means for communicating detected 
faults to a remote location for providing asset surveil- 
lance. 

14. The system of claim 13 wherein said parameter 
estimation means for producing said set of estimated signal 
values is of a type individually selected from a group 
comprised of a multivariate state estimation technique 
method, a neural network method, a mathematical process 

30 model method, a autoregressive moving average method, 
and a Kalman filter method. 

15. The system of claim 14 wherein said empirically 
derived probability density function utilized in said sequen- 
tial probability test is obtained by acquiring training data 

35 values correlative to expected asset operation and obtaining 
residual data by computing differences between said training 
data values and values obtained by using said parameter 
estimation model on said training data values and fitting a 
general probability density function (PDF) to said residual 
40 data. 

16. The system of claim 15 wherein said fitted general 
PDF is a function of statistical moments of said residual 
data. 

17. The system of claim 16 wherein said general PDF is 
45 a function of a first statistical moment of said residual data, 

of a second statistical moment of said residual data, and at 
least one higher order statistical moment of said residual 
data. 

18. The system of claim 17 wherein said first statistical 
50 moment of said residual data is a sample mean. 

19. The system of claim 17 wherein said second statistical 
moment of said residual data is a sample variance. 

20. The system of claim 17 wherein said at least one 
higher order statistical moment of said residual data is a 

55 sample skewness. 

21. The system of claim 17 wherein said at least one 
higher order statistical moment of said residual data is a 
sample kurtosis. 

22. The system of claim 17 wherein said at least one 
60 higher order statistical moment of said residual data is a fifth 

or higher statistical moment. 

23. The system of claim 15 wherein said general prob- 
ability density function (PDF) of said acquired residual data 
is comprised of a standard Gaussian PDF and at least one 

65 higher order term to said standard Gaussian PDF. 

24. A method for training a plurality of process models 
wherein each process model represents a set of parameter 
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estimation models and a set of fault detection models of an 
asset to be monitored by a surveillance system, the steps 
including: 

acquiring at least one set of signal values associated with 
a current status of said asset; 

forming a training data set comprised of at least one said 
set of signal values; 

providing a memory means for storing and retrieving said 
training data set; 

processing said training data set comprised of at least one 
said set of signal values associated with said current 
status of said asset for training at least one parameter 
estimation model; 

estimating at least one set of process parameters by using 
at least one said parameter estimation model for pro- 
cessing said training data set; 

empirically fitting a general probability density function 
to at least one said set of signals values in said training 
data set as a function of at least one said estimated set 
of process parameters and said training data set for 
obtaining at least one empirically derived fault detec- 
tion model; and 

storing within said memory means said at least one said 
parameter estimation model and at least one said 
empirically derived fault detection model for subse- 
quent use in surveillance of said asset. 

25. A method for performing surveillance of an asset, the 
steps including: 

numerically fitting a probability density function to a 
distribution correlative to normal operation of the asset; 

acquiring a current set of observed signal data values from 
the asset; 

calculating a current set of estimated signal data values 
from a stored model for said asset; 

transforming said calculated current set of estimated 
signal data values with said current set of observed 
signal data values for defining residual data values; 

utilizing said fitted probability density function in a 
statistical hypothesis test performed on said residual 
data values for detecting a fault condition; and 
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outputting a signal correlative to each detected fault 
condition for providing asset surveillance. 

26. The method of claim 25 ether including a step of 
forming said stored model. 

5 27. The method of claim 26 wherein the step of forming 

said stored model includes a step of preparing said stored 
model using historical operating data from the asset. 

28. The method of claim 27 wherein the step of preparing 

10 said stored model using historical operating data from the 

asset further includes a step of creating a training data set 
having discrete observations correlative to the expected 
normal operation of the asset wherein each single observa- 
tion is comprised of a vector of data values from a digitized 

15 set of signals correlative to each signal parameter to be 
included in said stored model. 

29. The method of claim 28 wherein said step of forming 
said stored model includes a step of creating a parameter 
estimation model by training said stored model using said 

20 training data set. 

30. The method of claim 25 wherein said step of numeri- 
cally fitting a probability density function to a distribution 
correlative to normal operation of the asset includes a step 

25 of preparing said fitted probability density function using 
historical operating data from the asset. 

31. The method of claim 30 wherein the step of preparing 
said fitted probability density function using historical oper- 
ating data from the asset further includes a step of creating 

30 a training residual data set having discrete residual data 
values correlative to the expected normal operation of the 
asset wherein each residual data value is comprised of a 
deviation between a historical operating data signal value 
and a estimated signal value produced by said stored model 

35 for said asset. 

32. The method of claim 31 wherein said step of forming 
said fitted probability density function includes a step of 
numerically determining an equation correlative to said 
training residual data set having discrete residual data values 

40 correlative to the expected normal operation of the asset. 





